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SECRETED POLYPEPTIDE SPECIES (FRAGMENTS FROM CHITOTRIOSIDASE) 
REDUCED IN CARDIOVASCULAR DISORDERS 

FIELD OF THE INVENTION 
The invention relates to secreted polypeptide species absent £rom the plasma of individuals 
with cardiovascxdar disorders, but present in the plasma of individuals free from the disease, isolated 
polynucleotides encoding such polypeptides, polymorphic variants thereof and the use of said 
nucleic acids and polypeptides or compositions thereof in detection assays, for cardiovascular 
disorder diagnosis, for. cardiovascular disorder treatment and for drug development. 

BACKGROUND 

Cardiovascular disease is a major health risk throughout the industrialized world. Coronary 
Artery Disease (CAD) is characterized by atherosclerosis or hard^iing of the arteries. 
Atherosclerosis is the most prevalent of cardiovascular diseases, is the principal cause of heart 
attack, stroke, and gangrene of the extremities, and thereby the principle cause of death in the United 
States. Atherosclerosis is a complex disease involving many cell types and molecular &ctors 
(described in, for example, Ross, 1993, Nature 362: 801-809). In normal circumstances a protective 
response to insults to the ^dotfaelium and smooth muscle cells (SMCs) of the wall of the artery 
consists of Ihe formation of fibro&tty and fibrous lesions or plaques, preceded.and accompanied by 
inflammation. The advanced lesions of atherosclerosis may occlude the artery concmied, and result 
from an excessive inflaimnatory-fibroproliferative response to numerous different forms of insult. 
Injury or dysfunction of the vascular endothelium is a common feature of many conditions that 
predispose an individual to accelerated development of atherosclerotic cardiovascular disease. 

Atherosclerotic plaques occlude the blood vessel concerned and restrict the flow of blood, 
resulting in ischemia. Ischemia is a condition characterized by a ladk of oxygen supply in tissues of 
organs d\ie to inadequate perfusion. Such inadequate perfusion can have a number of natural causes, 
including atherosclerotic or restenotic lesions, anemia, or stroke. The most common cause of 
ischemia in the heart is atherosclerotic disease of epicardial coronary arteries. By reducimg the lumen 
of these vessels, atherosclerosis causes an absolute decrease in myocardial perfusion in the basal 
state or limits appropriate increases in perfusion whqa the demand for flow is augmented. Coronary 
blood flow can also be limited by arterial thrombi, spasm, and,' rarely, coronary emboli, as well as by 
ostial narrowing due to luetic aortitis. Congenital abnormalities, such as anomalous origin of the left 
anterior descmding coronary artery from the pulmonary artery, may cause myocardial ischemia and 
in&rction in in&ncy, but this cause is very rare in adults. 

Myocardial ischemia can also occur if rrr/ocardial oxygen demands are abnormally 
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increased, as in severe ventricular hypertrophy due to hypertension or aortic stmosis. The latter can 
be pres^ with angina that is indistinguishable Scorn that caused by coronary adierosclerosis. A 
reduction in the oxygen-carrying capacity of the blood, as in extremely severe anemia or in the 
presence of carboxy-hemoglobin, is a rare cause of myocardial ischemia. Not iofrequently, two or 
more causes of ischemia will coexist, such as an increase in oxygen demand due to left ventricular 
hypertrophy and a reduction in oxygen supply secondary to coronary atherosclerosis. 

JExt©Qsiyia.clinic^studies.hayejdentii5ed-fectors.th^ increase the risk of cardiovascular 
disorders. Some of these risk fectors, such as age, gender, and &niily history cannot be changed. 
Other risk factors include the following: smoking, high blood pressure, high fet and high cholesterol 
diet, diabetes, lack of exercise, obesity, and stress. 

Fortunately, many contributing &ctors are controllable through lifestyle dhaiiges. The risk 
of cardiovascular disorders for smokers is more than twice that of non-smokers. When a person 
stops smoking, regardless of how much he or she may have smoked in the past, their risk of 
developing a disorder rapidly declines. Serum cholesterol level is directly related to prevalence of 
cardiovascular disorder and hypertension or high blood pressure is an important risk factor. 
Physical activity has been postulated to reduce the risk of developing a cardiovascular disorder 
through various mechanisms: it increases myocardial oxygen supply, decreases oxygen demand, and 
improves myocardial contraction and its electrical impulse stability. Reduced oxygen demand and 
myocardial work are rdElected in lowered heart rate and blood pressure at rest. Physical activity also 
increases the diameter and dilatory capacity of coronary arteries, increases collateral artery 
formation, and reduces rates of progression of coronary artery a&erosclerosis. Obesity and the 
serum fetty acids are reduced by activity. 

There may be no noticeable symptoms of a cardiovascular disorder at rest, but symptoms 
such as chest pressiu-e may occur with increased activity or stress. Other first signs that can appear 
are heartburn, nausea, vomiting, numbness, shortness of breath, heavy cold sweating, unexplained 
fatigue, and feelings of anxiety. The more severe symptoms of cardiovascular disorders are chest 
pain (angina pectoris), rhythm disturbances (arrhythmias), stroke, or heart attack (myocardial 
inferction). Strokes and heart attacks result from a blocked artery in the brain and heart tissue, 
respectively. Because symptoms vary, the tests and treatments chosen can be very different firom one 
patient to another. 

Diagnostic tests useful in determining the extent and severity of cardiovascular disorder 
include: electrocardiogram (BKG), stress test, nuclear scanning, coronary angiography, resting EKG, 
EKG Multiphase Information Diagnosis Indexes, Holter monitor, late potentials, EKG mapping, 
echocardiogram. Thallium scan, PET, MRI, CT, angiogram and IVUS. Additional risk £ictor 
measures and useful diagnostics are common and best appUed by one of skill in the art of medicine. 
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There are many different therapeutic approaches, depending on the seriousness of the disease. For 
many people, cardiovascular disorders are managed with lifestyle changes and medications. More 
severe diagnoses may indicate a need fi>r surgery. 

Surgical approaches to the treatment of ischemic atherosclerosis include bypass grafting, 
5 coronary angioplasty, laser angioplasty, atherectomy, endartCTectomy, and percutaneous 

translumenal angioplasty (PCTA). The feilure rate after these approaches due to restenosis, in which 
the occlusions recur and often become even worse, is extraordinarily high (30-50%). It appears that 
much of the restenosis is due to further inflammation, smooth muscle accumulation, and thrombosis. 
Additioxial therapeutic approaches to cardiovascular disease have included treatments that 
10 encouraged angiogenesis and tissue remodeling in such conditions as ischemic heart and limb 
disease. 

The non-specific nature of most CAD and cardiovascular disorder symptoms makes 
definitive diagnosis difficult. More quantitative diagnostic methods suffer from variabiUty, both 
between individuals and between readings on a single individual. Thxis, diagnostic measures must be 
15 standardized and applied to iadividuals with well-documented and extensive medical histories. 

Further, current diagnostic methods often do not reveal the underlying cause for a given observation 
or reading. Thereft)re, a therapeutic strategy based on a particular positive result likely will not 
address the causative problem and may even be harmfiil to the individual. 

Methods of diagnosis that rely on nucleotide detection include genetic approaches and 
20 expression profiling. For example, genes that are known to be involved in cardiovascular disorders 
may be screened for mutations using common genotyping techniques such as sequencing, 
hybridization-based techniques, or PGR. In another example, expression from a known gene may be 
tracked by standard techniques including RTPCR, various hybridization-based techniques, and 
sequencing. These strategies often do not enable a practitioner to detect differences in mRNA 
25 processing and splicing, translation rate, mRNA stability, and posttranslational modifications such 
. as proteolytic processing, phosphorylation, glycosylation, and amidation. 

To address the current weaknesses in the diagnostic state of the art for cardiovascular 
disorders, the invention provides polypeptides relevant to cardiovascular disorders. The invention 
provides the identity of polypeptide species that are undetectable in plasma from individuals with 
30 Coronary Artery Disease but present in control plasma. By providing the actual polypeptide species, 
differences in mKNA processing and splicing, translation rate, mRNA stability, and posttranslational 
modifications such as proteolytic processing, phosphorylation, glycosylation, and amidation are 
revealed. The polypeptides of the invention are thus described as "Cardiovascular disorder Plasma 
Polypeptides" or CPPs. These polypeptide sequences are described as SEQ ID NOs:3-4, and those 
35 comprising at least one of the amino add sequences selected from the tryptic peptide of Table 1 . 



3 
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TTie present invenlion discloses "Cardiovascular disorder Plasma Polypeptides" (CPPs), 
fragments, and post-translationally modified species of CPPs that are reduced in the plasma obtained 
from individuals with Coronary Artery Disease (CAD). Preferred fragments of the invention are 
those described as SEQ ID NOs:3-4. Thus, Ibe CPPs of the invention represent an important 

5 diagnostic tool for detennining the risk of CAD, coronary heart disease (CHD), peripheral vascular 
disease, cerebral ischemia (stroke), congestive heart failure, atherosclerosis, hypertension, and other 
cardiovascular diseases. CPPs are secreted fectors and as such, are ideal candidates for protein- 
based therapies. For dosage modulation in a clinical setting, protein therapy is preferable to genetic 
therapy, which is hampered by the lack of finely regulable expression. Further, as secreted factors, 

10 the polypeptide species of the invention are easy to target, e.g., with a small molecule or protein 

modulator. Thus, the polypeptide species of the invention are usefiil for drug development and design 
of therapeutic strategies to prevent and treat cardiovascular disease. 

SUMMARY OF THE PfV^NTION 

15 The present invention is directed to compositions related to secreted, active polypeptide 

species that, while present in normal human plasma, are undetectable in plasma from individuals 
with a cardiovascular disorder. These polypeptide species are designated herein "Cardiovascular 
disorder Plasma Polypeptides," or CPPs. In particular, preferred CPPs of the invention are 
Augments, described as SEQ ID NO:3-4, and designated herein CPP 6 and CPP 7. Compositions 

20 include CPP precursors, antibodies specific for CPPs, including monoclonal antibodies and other 
binding compositions derived therefrom. Further included are methods of making and using these 
compositions. Precursors of liie invention include unmodified precursors, proteolytic precursors of 
SEQ ID NOs:l-5, and intermediates resulting from alternative proteolytic sites in the amino acid 
sequences of SEQ ID NOs:l-5. 

25 A preferred embodiment of the invention includes CPPs having a posttranslational 

modification, such as a phosphorylation, glycosylation, acetylation, amidation, or a C-, N- or O- 
linked carbohydrate group. Additionally preferred are CPPs with intra- or mter-molecular 
interactions, e.g., disulfide and hydrogen bonds, that result in higher order structures. Also preferred 
are CPPs that result from differential mRNA processing or splicing. Preferably, the CPPs represent 

30 posttranslationafly modified species, structural variants, or splice variants that are differentially 
reduced in the plasma 6om individuals having or at risk of developiog a cardiovascular disorder. 

In another aspect, the invention includes CPPs conq)risiag a sequence yMch is at least 95 
percent identical to a SEQ ID NOs:3-5, Preferably, the invention includes polypeptides comprising 
at least 97 percent, and more preferably at least 98 percent, and still more preferably at least 99 
35 percent, identity with any one of the sequences selected from SEQ ID NOs:3-5. Most preferably, the 

4 
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invmdon includes polypeptides conq)risi]ig a sequence at least 99 percmt identical to a SEQ ID 
NOs:3-5. 

In another aspect, the invention includes natural variants of CPPs having a fiequency in a 
selected population of at least two percent. More preferably, such natural variant has a frequency in 
5 a selected population of at least five percent, and still more preferably, at least ten percent. Most 
preferably, such natural variant has a frequency in a selected population of at least twenty percent 
The selected population may be any recognized population of study in the field of population 
genetics. Preferably, the selected popxilation is Caucasian, Negroid, or Asian. More preferably, the 
selected population is French, German, English, Spanish, Swiss, Japanese,- Chinese, Irish, Korean, 

10 Singaporean, Icelandic, North American, Israeh, Arab, Turkish, Greek, ItaUan, Polish, Pacific 

Islander, Finnish, Norwegian, Swedish, Estonian, Austrian, or Indian. More preferably, the selected 
population is Icelandic, Saanoi, Finnish, French of Caucasian ancestry, Swiss, Singaporean of 
Chinese ancestry, Korean, Japanese, Quebecian, North American Pima Indians, Pennsylvaman 
Amish and Amish Mennonite, Newfoundlander, or Polynesian. 

15 A preferred aspect of the invmtion provides a composition comprising an isolated CPP, i.e., 

a CPP free from proteins or protein isoforms having a significantly diflfoent isoelectric point or a 
significantly different apparent molecular weight from tiie CPP. The isoelectric point and molecular 
weight of a CPP may be indicated by affinily and size-based separation chronaatography, 2- 
dimensional gel analysis, and mass spectrometry. 

20 In a piefened aspect, the invention provides particular polypq)tide species that comprise an 

amino add sequence selected from the group consisting of SEQ ID NOs:3-5. Preferably, iho 
particular polypqptide species fiuther comprises contiguous anuno acid sequence from the 
appropriate fiiU length sequence selected from the group consisting of SEQ ID NOs: 1 and 2. 
Preferred species are polypeptides that i) comprise an amino acid sequence selected from the group 

25 consisting of SEQ ID NOs:3-5; ii) appear at a higher level in plasma from control individuals 
without a cardiovascular disorder; and iii) result from proteolytic processing of a full length 
polypeptide sequaace selected from the group consisting of SEQ ID NOs. l and 2. Particularly 
preferred polypeptides are the CPP 6 (SEQ ID N0:3) and CPP 7 (SEQ ID NO:4). 

In an additional aspect, the inv^tion includes modified CPPs. Such modifications include 

30 protecting/blocking groups, linkage to an antibody molecule or other cellxilar Kgand, and detectable 
labels, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection and 
isolation of the protein. Chemical modifications may be carried out by known techniques, including 
but not limited, to specific chemical cleavage by cyanogen bromide, tiypsin, chymotiypsin, papain, 
V8 protease, NaBH4, acetylation, fonnylation, oxidation, reduction, or metabolic synthesis in the 

35 presence of tunicamycin. 

5 
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Also provided by the inventioa are chemically modified derivatives of the polypeptides of the 
invention ^^di may provide additioiial advantages such as increased solubility, stability and 
circulating time of the polypeptide, or decreased immuhogenicity (e.g., water soluble polymers sudi 
as polyethylene glycol, ethylene glycol/propylene glycol copolymers, carboxymethylcellulose, 
5 dextran, polyvinyl alcohol). The CPPs are modified at random positions within the molecule, or at 
predetermined positions within the molecule and may include one, two, three or more attached 
chemical moieties. 

In another embodiment, the invention provides a method of identifying a modulator of at 
least one CP? biological activity comprising the steps of: i) contacting a test modulator of a CPP 

10 biological activity with the polypeptide comprising the amino acid sequence selected fi-om the group 
consisting of SEQ ID NOs: 1-5; ii) detecting the level of said CPP biological activity; and iii) 
comparing the level of said CPP biological activity to that of a control sample laddsg said test 
modulator. Where the difference in the level of CPP protein biological activity is a decrease, the test 
modulator is an inhibitor of at least one CPP biological activity. Where the difference in the level of 

15 CPP biological activity is an increase, the test substance is an activator of at least one CPP 
biological activity. 

In another aspect, the mvendon includes polynucleotides encoding a CPP of the invention, 
polynucleotides encoding a polypeptide.having an amino add sequmce selected fi-om the group 
consisting of SEQ ID NOs: 1-5, antisense oligonucleotides complementary to such sequences, 

20 oligonucleotides complementary to CPP gene sequences for diagnostic and analytical assays (e.g., 
PCR, hybridization-based techniques), and vectors for expressing CPPs. 

In another aspect, the invention provides a vector comprising DNA encoding a CPP. The 
invention also includes host cells and transgenic nonhuman animals comprising such a vector. 
There is also provided a method of making a CPP or CPP precursor. One preferred method 

25 comprises the steps of (a) providing a host cell containing an expression vector as disclosed above; 
(b) culturing the host cell under conditions whereby the DNA segment is expressed; and (c) 
recovering the protein encoded by the DNA segtn^t. Another preferred method comprises the steps 
of (a) providing a host cell capable of expressing a CPP; (b) culturing said host cell under 
conditions that allow expression of said CPP; and (c) recovering said CPP. Within one embodiment 

30 the expression vector fiuther comprises a secretory signal sequence operably linked to the DNA 
segment, the cell secretes the protein into a culture medium, and the protein is recovered firom the 
medium. An especially preferred method of making a CPP includes chemical synthesis using 
standard peptide synthesis techniques, as described in the section titled '"Chemical Manu&ctuie of 
CPP Compositions'' and in Example 3 . 

35 In another aspect, the invention includes isolated antibodies specific for any of the 
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polypeptides, peptide firagments, or peptides described above. Preferably, the antibodies of the 
invention axe monoclonal antibodies. Further preferred are antibodies that bind to a CPP 
exclusively, that is, antibodies that do not recognize other polypeptides, e.g., tiiose not derived from a 
polypeptide selected from the group consisting of SEQ ID NOs: 1-5. Anti-CPP antibodies have 
5 purification, diagnostic and therapeutic applications, particularly in treating CPP-related disorders. • 
Preferred anti-CPP antibodies for purification and diagnosis are attached to a label group. Preferred 
CPP-related disorders for diagnosis include coronary artery disease (CAD), coronary heart disease 
(CHD), peripheral vascular disease, cerebral ischemia (stroke), congestive heart feilure, 
atherosclerosis, hypertension, and other cardiovascular diseases. Treatment and diagnostic mefliods 
10 include, but are not limited to, those that employ antibodies or antibody-derived compositions 

specific for a CPP antigen. Diagnostic methods for detectmg CPPs in specific tissue samples and 
biological fluids (preferably plasma), and for detecting levels of expression of CPPs in tissues, also 
form part of the invention. Compositions comprising one or more antibodies described above, 
together with a phannaceutically accq)table carrier are also within the scope of the invention. 
15 The invention fiulher provides methods for diagnosis of cardiovascular disorders that 

comprise detecting in a sample of body fluid, preferably blood plasma, the presence or level of at 
least one CPP disclosed herein or any combination thereof These methods are also suitable for 
dimcal screening, prognosis, monitoring the results of therapy, identtfymg patients most likely to 
respond to a particular therapeutic treatment, drug screening and development, and identification of 
20 new targets for drug treatment. 

The invention provides kits that may be used in the above-recited methods and tiiat may 
comprise single or multiple preparations, or antibodies, together with other reagents, label groups, 
substrates, if needed, and directions for use. The kits may be used for diagnosis of disease, or may be 
assays for the idratification of new diagnostic and/or therapeutic agents. 
25 In one embodiment. Coronary Artery Disease (CAD) is defined by the appearance of at least 

one symptom. Such symptoms become more serious as the disease progresses. CAD is oflm 
accompanied by reduced left ventricle capacity or output. Early CAD symptoms include elevated 
plasma levels of cholesterol and low-density lipoprotein (especially oxidized forms), as well as 
platelet-rich plasma aggregations. The vascular endothelium responds to inflammation and thus 
30 formation of plaques and levels of inflammatory and fibrinogenic fectors increase. In addition, CAD, 
or atherosclerosis, is characterized by vascular calcification and hardening of the arteries. The 
resulting partial occlusion of the blood vessels leads to hypertension and ischemic heart disease. 
Eventual complete vascular occlusion results in myocardial inferction, stroke, or gangrene. 

In a preferred embodiment, detection of reduced plasma levels of at least one CPP of the 
35 invention indicates an increased risk that an individual will develop CAD. Preferably, said detection 

7 
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indicates that an individual has at least a LOS-&ld, l.l-fi>ld, 1.15-fold, and more preferably at least 
a 1 .2-fold increased likelihood of developing CAD. Alternatively, detection of reduced plasma levels 
of at least one CPP of Ihe invention indicates that an individual has CAD. The amount of CPP 
decrease observed in an individual plasma sample compared to a control sample will correlate with 
the certainty of the prediction or diagnosis of CAD. As individual plasma CPP levels will vary 
depending on femily history and other risk fectors, each will preferably be examined on a case-by- 
case basis. In preferred embodiments, CPP is detected in a human plasma sample by the metiiods of 
the invention. Especially preferred techniques are mass spectrometry and immxmodetection. 
Preferably, a prediction or diagnosis of CAD is based on at least a 1 . 1-, 1 . 15-, 1 .2-, 1 .25-, and more 
preferably a 1 .5-fold decrease m the e3q)erimental CPP level as compared to the control. 

The invention ferther includes methods of using CPP con^ositions to treat disorders 
associated with aberrant expression or processing of CPPs of SEQ ID NOs:3-4 in an individual. 
Preferred CPP-related disorders include coronary artery disease (CAD), coronary heart disease 
(CHD), peripheral vascular disease, cerebral ischemia (stroke), congestive heart feilure, 
atherosclerosis, hypertension, and other cardiovascular diseases. Further included are methods of 
using CPi? compositions, including primers complementary to CPP genes and/or messenger RNA 
and anti- CPP antibodies, for detecting and measuring quantities of CPPs in tissues and biological 
fluids, preferably plasma. The invention further includes methods of screening for compounds that 
increase the e?q)ression of CPPs, as well as methods of screening for cornpoimds that interact with 
and/or increase the activity of CPPs. The invention fiirther includes methods of screening for 
compounds that inhibit the e3q)ression of CPPs, as well as methods of screening for compounds that 
interact with and/or inhibit the activity of CPPs. The invention further encompasses con?>ounds thus 
identified, as well as compositions thereof. 

In still a further aspect, the invention includes pharmaceutical compositions and 
formulations comprising a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ED NOs:l-5, and a pharmaceutically acceptable carrier conq)Ound. 

Further aspects of the invention are also described in the specification and in the claims. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTING 
SEQ ID N0:1 represents the amino acid sequence of fiill-length Chitotriosidase (SwissProt 
Q13231). 

SEQ ID N0:2 represents the amino acid sequence of the mature Chitotriosidase polypeptide. 
SEQ ID N0s:3 and 4 are the amino acid sequences of plasma proteins that are absent or 
reduced in the plasma of individuals with coronary artery disease, which are herein designated CPP 6 

8 
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and CPP 7, respectively. 

SEQ ID N0:5 describes the amino acid sequence of the tryptic peptide discovered by MS- 
MS and/or MS-MALDI mass spectrometry ni plasma samples of individuals without coronary artery 
disease. 

5 

BRIEF DESCRIPTION OF THE TABLE 

Table 1 lists the tryptic peptide found in fractionated plasma of control individuals without 
Coronary Artery Disease (CAD) and absent in Ihe plasma of patieats with CAD. The column 
labelled CEX indicates in which of the 18 cation exchange fractions the tryptic peptide was eluted, 
10 and the cohmm labelled Salt indicates the NaCl concentration for the elution of these fractions, 

according to the protocol described in Step 3 of Example 1 herein. RPl refers to the reverse phase 
fractions (fractions 1-30), and %B indicates the percentage of elution buffer for these fractions, 
according to the protocol described in Step 4 of Example 1 herein. The second reverse phase fraction 
(fractions 1-24) is indicated as the last two digits of the Run Number. 

15 

BBTEF DESCRIPTION OF THE FIGURE 
Figure 1 shows the sequences of human Chitotriosidase preprotein (SEQ ID NO:l); mature 
Chitotriosidase polypeptide (SEQ ID NO:2); and Cardiovascular disorder Plasma Polypeptides 6-7 
(SEQ ID NOs:3-4). The tryptic peptide observed in control plasma samples by tandem mass 
20 spectrometry is in bold and illustrated as SEQ ID NO:5. The signal sequence is underlined. Dibasic 
proteolytic cleavage sites are denoted by italic and underlined letters. The amino add sequences of 
the resulting processed p^tides, CPP 6-7 (SEQ ID NOs:3-4), are listed below. The putative active 
site of the protein is shown in bold and underlined (Boot, R.G. et al, J Biol Chem., (1995), 270, 
p.26252-56). 

25 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention described in detail below provides methods, compositions, and kits 
useful for screeiung, diagnosis, and treatment of a cardiovascular disorder in a mammalian 
individual; for identifying individuals most likely to respond to a particular therapeutic treatment; for 

30 monitoring the results of cardiovascular disorder therapy; for screening CPP modulators; and for 

drug development. When the invention is used for drug development, e.g., to determine the ability 6f 
a CPP modulator or drug candidate to induce an anti-cardiovascular disorder response, the body 
fluid analyzed for the level of at least one CPP is preferably from a non-human mammal. The non- 
human itiytmtnal is preferably one in \rtiich the mduction of an anti-cardiovascular disorder response 

35 by endogenous and/or oogenous agents is predictive of the induction of such a response hi a human. 

9 
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Rodeirts (mice, rats, etc.) and primates are particularly suitable for use in this aspect o£ ^ 
invention. The invention also eaconq)ass6S the administration of theFapeotic compositions to a 
tnflTTwnali'ati individual to treat or prevent cardiovascular disorders. The mairanalian individual may 
be a non-human mammal, but is preferably human, more preferably a human aduh. For clarity of 
disclosure, and not by way of limitation, the invention will be described with respect to the analysis 
of blood plasma samples. However, as one skilled in the art will appreciate, the assays and 
techniques described below can be applied to other biological fluid samples (e.g. cerebrospinal fluid, 
lymph, bile, serum, saliva or urine) or tissue samples from an individual at risk of having or 
developing a cardiovascular disorder. The methods and compositions of the present invention are 
useful for screening, diagnosis and prognosis of a living individual, but may also be used for 
postmortem diagnosis in an individual, for example, to identify femily members who are at risk of 
developing the same disord^. 

Deftmtions 

As used herein, the term "nucleic acids" and "nucleic acid molecule" is intended to include 
DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of 
the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can be single- 
stranded or double-stranded, but preferably is double-stranded DNA. Throughout the present 
specification, &e expression '^nucleotide sequence" may be employed to designate indifferently a 
polynucleotide or a nucleic acid. More precisely, the ejqpression '"nucleotide sequence" encompasses 
tiie nucleic material itself and is thus not restricted to the sequence injformation (i.e. the succession of 
letters chosen among the four base letters) that biochemically characterizes a specific DNA or KNA 
molecule. Also, used interchangeably herein are terms "nucleic acids", "oligonucleotides", and 
"polynucleotides". 

An "isolated" nucleic acid molecule is one which is separated from other nucleic acid 
molecules which are present in the natural source of the nucleic acid. Preferably, an "isolated" 
nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at the 
5* and 3* ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid 
is derived. For example, in various embodiments, the isolated CPP nucleic acid molecule can contain 
less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0. 1 kb of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. 
Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free 
of other cellular material, or culture medium when produced by recombinant techniques, or 
substantially fi^ of chemical precursors or other chemicals when chemically synthesized. A nucleic 
acid molecule of the present invaitipn can be isolated using standard molecular biology techniques 

10 
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and the sequence iiifonnation provided herein. Using all or a portion ofthe nucleic add, as a 

hybridization probe, CPP nucleic acid molecules can be isolated using standard hybridization and 
cloning techniques (e.g., as described m Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning. A Laboratory Manual. 2nd, ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989). 

As used herein, the term "hybridizes to" is intended to describe conditions for moderate 
stringency or high stringency hybridization, preferably vfheA the hybridization and washing 
conditions permit nucleotide sequences at least 60% homologous to each other to remain hybridized 
to each other. Preferably, the conditions are such that sequences at least about 70%, more preferably 
at least about 80%^ even more preferably at least about 85%, 90%, 95% or 98% homologous to each 
other typically remain hybridized to each other. Stringent conditions are known to those skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. 
(1989), 6.3.1-6.3.6. In a preferred, non-limiting example, stringent hybridization conditions for 
nucleic add interactions are as follows: the hybridization step is realized at 65**C in the presence of 6 
X SSC buffer, 5 x Denhardt's solution, 0,5% SDS and lOO^g/ml of salmon sperm DNA. The 
hybridization step is followed by four washing st^s: 

. two washings during 5 min, preferably at 65^C in a 2 x SSC and 0.1%SDS buffer; 
. one washing during 30 min, preferably at 65X in a 2 X SSC and 0.1% SDS buffer, 
- one washing during 10 min, preferably at 65X in a 0.1 x SSC and 0.1%SDS buffer, 
these hybridization conditions being suitable for a nucleic acid molecule of about 20 nucleotides in 
iCTgth. It will be appreciated that the hybridization conditions described above are to be adapted 
according to tiie length of the desired nucleic acid, following techniques well known to the one skilled 
in the art, for example be ad^rted according to the teachings disclosed in Hames B.D. and Higgins 
S.J. (1985) Nucleic Acid Hybridization: A Practical Approach. Hames and Higgins Ed., IRL Press, 
Oxford; and Current Protocols in Molecular Biology (supra). 

*Tercent homology" is used herein to refer to both nucleic acid sequences and amino acid 
sequences. As used herein, amino acid or nucleic acid "identity" is equivalent to amino acid or 
nucleic acid "homology". To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., g^s can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence and non-homologous sequences can be disregarded for 
comparison purposes). The length of a reference sequence aligned for comparison purposes is at 
least 30%, preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, 
and even more preferably at least 70%, 80%, 90% or 95% of the lengtii of tiie reference sequence. 
The amino acid residues or nucleotides at corresponding amino add positions or nucleotide positions 

11 
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aie tiien coiqpared. When a position in the first sequence is occupied by the same amino acid residue 

or nucleotide as Ifae corresponding position in the second sequence, then the molecules are 
homologous at that position. The percmt homology between the two sequences is a function of the 
number of identical positions shared by the sequences (i.e., % homology=# of identical 
positions/total # of positions 100). 

The comparison of sequences and determination of percent homology between two sequences 
can be accomplished using a mathematical algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequmces is the algorithm of KarUn and 
Altschul (1990) Proc. Natl. Acad, Sci. USA 87:2264-68, modified as in Karlin and Altschul (1993) 
Proc. Natl. Acad. Sci. USA 90:5873-77, the disclosures of which are incorporated herein by 
reference in their entireties. Such an algorithm is incorporated into the NBLAST and XBLAST 
programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide 
searches can be performed with the NBLAST program, score=100, wordlengdi=12 to obtain 
nucleotide sequences homologous to the sequences of the invention. BLAST protein searches can be 
performed with the XBLAST program, scor^SO, wordlength=3 to obtain amino acid sequences 
homologous to the polypeptide sequences of the invention. To obtain gapped alignments for 
comparison purposes. Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic 
Acids Research 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the 
defeult parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http y/www.ncbi.nhn.nih.gov, tiie disclosures of which are mcorporated herein by reference in their 
entireties. Anotiber preferred, non-limiting exanqple of a mathematical algorithim utilized for the 
comparison of sequences is the algoriflmi of Myers and Miller, CABIOS (1989), the disclosures of 
which are incorporated herein by reference in thek entireties. Such an algorithm is incorporated into 
the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package. 
When utihzmg the ALIGN program for comparing amino acid sequences, a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

The term "polypeptide" refers to a polymer of amino acids without regard to the length of 
the polymer; thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide. This term also does not specify or exclude post-translational modifications of 
polypeptides, for example, polypeptides which include the covalent attachment of glycosyl, acetyl, 
phosphate, amide, lipid, carboxyl, acyl, or carbohydrate groups are expressly encompassed by the 
term polypeptide. Also included within the definition are polypeptides which contain one or more 
analogs of an amino acid (iacluding, for example, non-naturally occurring amino acids, amino acids 
which only occur naturally in an unrelated biological system, modified amino acids from mammalian 
systems etc.), polypeptides with substituted linkages, as well as other modifications known in the art, 

12 
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both naturally occurring and non-naturally occurring. 

The term **pTotem'* as used herein may be used synonymously witb the term **polypeptide" or 
nmy refer to, in addition, a complex of two or more polypeptides wMchnm other 
than peptide bonds, for example, such polypeptides makmg \sp the protein may be linked by disulfide 
bonds. The tenn *^rotein'* may also comprehend a &mily of polypeptides having identical amino 
acid sequences but different post-translational modifications, particularly as may be added when 
such proteins are expressed in eukaryotic hosts. 

An "isolated" or "piuified" protein or biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from the cell or tissue source from which the 
protein of the invention (i.e., CPP or biologically active fragment thereof) is derived, or substantially 
free from chemical precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of a protein according to the invention 
in which the protein is separated from cellxilar ciomponents of the cells from which it is isolated or . 
recombinantly produced. In one embodiment, the language "substantially free of cellular material" 
includes preparations of a protein according to the invention having less than about 30% (by dry 
weight) of protein other than the protein of the invention (also referred to herein as a "contaminating 
protein"), more preferably less than about 20% of protein other than the protein according to the 
invention, still more preferably less than about 1 0% of protein other than the protein accordii^ to the 
invention, and most preferably less than about 5% of proteia other than the protein according to the 
invention. When the protein according to the urvention or biologically active porticm thereof is 
recombmantly produced, it is also preferably substantially free of culture medium, i.e., culture 
medium represents less than about 20%, more preferably less than about 10%, and most preferably 
less than about S% of the volume of the protein preparation. 

The language "substantially firee of diemical precursors or other chemicals" includes 
preparations of a protein of the invention in which the protein is separated from chemical precursors 
or other chemicals which are involved in the synthesis of the proteia. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations of a 
proteia of the invention having less than about 30% (by diy weight) of chemical precursors or non- 
protein chemicals, more preferably less than about 20% chemical precursors or non-protein 
chemicals, still more preferably less than about 10% chemical precursors or non-proteia chemicals, ' . 
and most preferably less than about 5% chemical precursors or non-protein chemicals. 

The term "recombinant polypeptide** is used herein to refer to polypeptides that have been 
artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in tfieir initial natural environment, or to refer to polypeptides 
which have been expressed &om a recombinant polynucleotide. 
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The term "Cardiovascular disorder Plasma Polypeptide" or "CPP" refers to a polypeptide 
conq)Tising flie sequence described by any one of SEQ ID NOs:3-5, preferably, conq>rising the 
polypeptide sequence of SEQ ID N0:5 . Such polypeptide may be post-transladonally modified as 
described herein. CPPs may also contain oilier stractural or chemical modifications such as disulfide 
5 linkages or amino acid side chain interactions such as hydrogen and amide bonds that result in 

complex secondary or tertiary structures. CPPs also include mutant polypeptides, such as deletion, 
addition, swap, or truncation mutants, fusion polypeptides comprising such polypeptides, and 
. polypeptide fiagments of at least three, but preferably 8, 10, 12, 15, or 21 contiguous amino acids of 
a sequence selected from the group consisting of SEQ ID NOs:3-5. Further included are CPP 
10 proteolytic precursors and intermediates of the sequence selected from the group consisting of SEQ 
ID NOs: 1-5 . The invention embodies polypeptides encoded by the nucleic acid sequences of CPP 
genes or CPP mRNA species, preferably human CPP genes and mRNA species, including isolated 
CPPs consisting of, consisting essentially o^ or conq>rising the sequence of any one of SEQ ID 
NOs: 1-5. Preferred CPPs have a sequence comprising the sequence of SEQ ID NOs:3-5. Preferred 
15 CPPs retain at least one biological activity of CPPs of SEQ ID NOs:3-4. 

The term '*biolo^cal activity" as used herein refers to any function carried out by a CPP. 
These iuclude but are not limited to: (1) protection from cardiovascular disease; (2) circulating 
through the bloodstream; (3) antigenicity, 9r the ability to bind an anti-CPP specific antibody; (4) 
hmnufaogenicity, or the ability to generate an anti-CPP specific antibocfy; or (5) interacting with a 
20 CPP target molecule, preferably a protein. 

As used herein, a "CPP modulator" or "CPP 6 modulator" or "CPP 7 modulator** is a 
molecule (e.g., polynucleotide, polypeptide, small molecule, or antibody) that is capable of 
modulating (i.e., increasing or decreasiug) either the expression or the biological activity of the CPPs 
of the invention. A CPP modulator that enhances CPP expression or activity is described as a CPP 
25 activator or agonist. Conversely, a CPP modulator that represses CPP expression or activity is 
described as a CPP inhibitor or antagonist. Preferably, CPP modulators increase/ decrease the 
expression or activity by at least 5, 10, or 20%. CPP inhibitors include anti-CPP antibodies, 
fragments thereof, antisense polynucleotides, and molecules characterized by screening assays, as 
described herein. CPP agonists include polynucleotide expression vectors and molecules 
30 characterized by screening assays as described herein. 

A "CPP-related disorder" or "CPP-related disease" describes a cardiovascular disorder. 
Preferred disorders include coronary artery disease (CAD), coronary heart disease (CHD), peripheral 
vascular disease, cerebral ischemia (stroke), congestive heart feilure, atherosclerosis, hypertension, 
and other cardiovascular diseases. Preferably, the likelihood that an individual will develop or 
35 already has such a disorder is indicated by reduced plasma levels of at least one CPP. 
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Another aspect of the invention pertains to anti-CPP antibodies. The tenn "antibody as used 
herein refers to immunoglobulin molecules and immunologically active portions of inununoglobulin 
molecules, i.e., molecules that contain au antigen-binding site \wfaich specifically binds 
(unmunoreacts wiifa) an antigen, such as CPP, or a biologically active fragment or homologue 
thereof. Preferred antibodies bind to one CPP selected from the group consisting of SEQ ID N0s:3-4 
exclusively and do not bind other polypeptide sequences with high affinity. Examples of 
immunologically active portions of immunoglobulin molecules include F(ab) and F(ab')2 fragments 
which can be generated by treating the antibody with an enzyme such as pepsin. The invention 
provides polyclonal and monoclonal antibodies that bkid a CPP, or a biologically active fragment or 
homologue thereof. The term "monoclonal antibody" or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that contaia only one species of an antigen- 
binding site capable of immunoreacting with a particular epitope of a CPP. A monoclonal antibody 
composition thus typically displays a single binding aflBnity Jfor a particular CPP with ^ch it 
immunoreacts. Preferred CPP antibodies are attached to a label group. 

As used herein, a "label group" is any con5)ound that, when attached to a polynucleotide or 
polypeptide (including antibodies), allows for detection or purification of said polynucleotide or 
polypeptide. Label groups may be detected or purified direcfly or indirectiy by a secondary 
con^oimd, including an antibody specific for said label group. Usefiil label groups include 

32 • 35 3 I2S 

radioisotopes (e.g., P, S, H, fluorescent compoimds (e.g., 5-bromodesoxyuridin, 
umbelliferone, fluorescein, fluorescein isothiocyanate, ifaodamine, dichlorotriazinylamine fluorescein, 
dansyl chloride, phycoerythrin acetylaminofluorene, digoxigenin), luminescent compoimds (e.g., 
hmiinol, GFP, luciferin, aequorin), enzymes or enzyme co-factor detectable labels (e.g., peroxidase, 
luciferase, alkaline phosphatase, galactosidase, or acetylcholinesterase), or compoxmds that are 
recognized by a secondary fector such as strepavidin, GST, or biotin. Preferably, a label group is 
attached to a polynucleotide or polypeptide in such a way as to not interfere with the biological 
activity of the polynucleotide or polypeptide. 

Radioisotopes may be detected by direct coimting of radioemission, film e>q)Osure, or by 
scintillation counting, for ©cample. Enzymatic labels may be detected by determination of 
conversion of an appropriate substrate to product, usually causing a fluorescent reaction. 
Fluorescent and luminescent compoimds and reactions may be detected by, e.g., radioeiaission, 
fluorescent microscopy, fluorescent activated cell sorting, or a luminometer. 

As used herein with respect to antibodies, an antibody is said to "selectively bind" or 
"specifically bind" to a target if tiie antibocfy recognizes and binds the target of interest but does not 
substantially recognize and bind other molecules m a sample, e.g., a biological sample, which 
includes the target of interest. 
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As used herein, the term "vector" refers to a nucleic acid molecule enable of transporting 
another nucleic acid to vAddi it has been linked. One type of vector is a "plasmid", \*idi refers to a 
circular double stranded DNA loop into which additional DNA segments can be ligated. Another 
type of vector is a viral vector, \s^ereia additional DNA segments can be ligated into the viral 
S genome. Certain vectors are capable of autonomous replication in a host cell into which they are 
introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a 
host cell upon introduction into tiie host cell, and thereby are replicated along with the host genome. 
Moreover, certain vectors are capable of directing the expression of genes to which they are 

10 operatively linked. Such vectors are referred to herein as "expression vectors". In general, e?q)iession 
vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present 
specification, "plasmid" and "vector" can be used interchangeably as the plasmid is the most 
commonly used form of vector. However, the invention is intended to include such other forms of 
e3q)ression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and 

15 adeno-associated viruses), which serve equivalent functions. 

As used herein, "efBsctive amount" describes the amount of an agent, preferably a CPP or 
CPP modulator of the invention, sufi&cient to have a desired effect. For example, an 
anticardiovascular disorder efifective amount is the amount of an agent required to reduce a symptom 
of a cardiovascular disorder in an individual by at least 1, 2, 5, 10, 15, or preferably 25%. The term 

20 may also describe the amount of an agent required to ameliorate a cardiovascular disorder-caused 
synqjtom in an individual. Common synq)toms of cardiovascular disorders include: chest pressure, 
heartburn, nausea, vomiting, numbness, shortness of breath, heavy cold sweating, xmexplained 
fetigue, and feelings of anxiety. The more severe synq)toms of cardiovascular disorders are chest 
pain (angina pectoris), rhythm disturbances (arrhythmias), stroke, or heart attack. The effective 

25 amount for a particular patient may vary depending on such fectors as the diagnostic method of the 
syn^tom bemg measured, the state of the condition being treated, the overall health of the patient, 
method of administration, and the severity of side-effects. 

CFPs of the invention 

30 The Cardiovascular disorder Plasma Polypeptides (CPPs) of the invention are described in 

the sequence listing as SEQ ID NOs:3-4. CPPs having an amino acid sequence selected fi-om the 
group consisting of SEQ ID NOs:3-5 are secreted and circulate in blood plasma. Preferably, such 
CPPs also comprise additional amino adds from the full-length human protem of SEQ ID NO:l or 
2. Such additional aimno acids are fused in frame with the selected sequenceto fonn contiguous 

35 fragment ofthefuU-lengthhmnan protein.. 
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Interestingly, levels of the CPPs of tiie inventioa are decreased and undetectable in the 
plasma of individuals suffering jfrom cardiovascular disorders. As such, the CPPs of the invention 
provide a useful diagnostic tool, v^erein a decreased level of a CPP indicates an increased risk of 
developing, or the presence o^ a cardiovascular disorder. Further, CPPs are useful for drug design 
and in therapeutic strategies for prevention and treatxamt of cardiovascular disorders. 

The sequence from SEQ ID NO: 1 is that of the protein Chitotriosidase, from the family of 
chitinases. Chitinases catalyze the hydrolysis of the beta-l,4-N-acetyl-D-gIucosamine linkages in 
chitin polymers. They have been found ia a wide variety of non-vertebrate species. A human 
homolog of chitinase, chitotriosidase, has been identified. Chitotriosidase is a member of 
glycosylhydrolase femily 18, characterized by a conserved glutamate involved in the catalyzation 
reaction (Boot RG, et al., JBC, (1998); 273: 25680-25685). It is primarily expressed by activated 
macrophages. 

Chitotriosidase is predominantly secreted, but in part processed into a 39-kDa form that 
accumulates in lysosomes. The C-terminal extension of the 50-kDa chitotriosidase contains sialic- 
add containing O-linked glycans. It was observed that, in macrophages, alternative splicmg 
generates a distinct chitotriosidase mRNA species, encodiog a40-kDa chitotriosidase that is C- 
termmally truncated. This enzyme is ahnost identical to the 39-kDa diitotriosidase formed from the 
50-kDaprecursor by proteolytic processing. The C-terminus present in the 50-kDa chitotriosidase, 
but absent in tiie 39- kDa isofonn, mediates tight binding to chitin. However, 39-kDa chitotriosidase. 
is still active against soluble substrates. In the bloodstream, the secretory 50-kDa chitotriosidase 
occurs predonunantty, while the 39-kDa form is abundant in tissues . 

Chitotriosidase secretion from the abnormal macrophages of Gaucher patients is markedly 
increased (Giraldo P, et al., Haematologica, Sep (2001); 86(9): 977-984). Gaucher disease is caused 
by an autosomal recessive inherited deficiency in the lysosomal hydrolase beta-glucocerebrosidase 
that results m accumulation of the lipid in lysosomes of tissue macrophages. The enzymatic defect 
leaxis to the accumulation of the lipid glucocerebroside in liver, spleen and bone marrow. In some 
other inherited lysosomal storage disorders, especially sphingolipidoses such as Niemann Pick, GMl- 
gangliosidosis, and Krabbe disease, which involve accumulation of different lipids, more modest 
elevations in plasma chitotriosidase have been noted. Ia addition, macrophages present in 
atherosclerotic plaques express hi^er levels of the enzyme. These observations have lead to 
speculation that chitotriosidase is involved in tissue remodeling. 

We describe here the "Cardiovascular disorder Plasma Polypeptides" of the invention that 
circulate in normal human plasma. The polypeptide species of the inv^on represent an iraportasst 
and accurate diagnostic tool for detenmning the risk of coronary artery disease (CAD), coronary 
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heart disease (CHD), peripheral vascular disease, cerebral ischemia (stroke), coi^estive heart 

feihire, atherosclerosis, hypertension, and other cardiovascular diseases. The polypeptides of Ihe 

invention are small, secreted &ctors and as such, are ideal candidates for protein-based therapies. 

For dosage modulation in a dinical setting, protein therapy is preferable to genetic therapy, which is 

hampered by Ihe lack of finely regulable e3q>ression. Furflier, as secreted factors, the polypqjtide 

species of the invention are easy to target, e.g., with a small molecule or protein inhibitor. Thus, the 

polypeptide species of the invention are useful for drug development and design of therapeutic 

strategies to prevent and treat cardiovascular disease. 

The polypeptide species of the invention contain the putative active site of chitotriosidase 
and are described as SEQ ED NOs:3-5. 

The terms "Cardiovascular disorder Plasma Polypeptide" and "CPP" are used herein to 
embrace any and all of tifie peptides, polypeptides and proteins of the present invention. Also 
forming part of the invention are polypeptides encoded by the polynucleotides of the invention, as 
well as fusion polypeptides comprising such polypeptides. The invention embodies CPPs from 
humans, including isolated or purified CPPs consisting of, consisting essentially oi^ or conq)rising an 
amino acid sequence selected firom the group consisting of SEQ ID NOs:3-5. Further included are 
CPP proteolytic precursors and intermediates of the sequence selected firom the group consisting of 
SEQIDN0s:l-2. 

The present invention embodies isolated, purified, and recombinant polypeptides comprising 
a contiguous span of at least 3 amino acids, preferably at least 8 to 10 amino acids, with a CPP 
biological activity, hi preferred embodiments the contiguous stretch of amino adds comprises the 
site of a mutadon or functional mutation, including a deletion, addition, swap or truncation of the 
amino adds in the CPP sequence. The invention also concerns ttie polypqptide encoded by the CPP 
nudeotide sequences of the mvention, or a complementary sequence thereof or a firagment thereof 

One aspect of the invention pertains to isolated CPPs, and biologically active portions thereof, 
as well as polypeptide firagments suitable for use as immunogens to raise anti-CPP antibodies. In one 
embodiment, native CPP peptides can be isolated fi-om plasma, ceUs or tissue sources by an 
appropriate purification scheme using standard protein purification techniques. In another 
embodiment, CPPs are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a CPP can be synthesized chemically using peptide synthesis techniques, as described in 
the section titled "Chemical Manufecture of CPP conq)ositions" and in Example 3 . 

Typically, biologically active portions comprise a domain or motif vsdth at least one activity of 
a CPP. A biologically active CPP may, for example, comprise at least 1, 2, 3, or 5 ammo acid 
changes from the sequence selected from the group consisting of SEQ ID NOs:3-5, or comprise at 
least 1%, 2%, 3%, 5%, 8%, 10% or 15% change in amino adds fix>m the sequence selected firom the 
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group consisting of SEQ ID NOs:3-5. 

Characterization of CPPs 

The polypeptides of the invention, CPPs, are defined by the tryptic peptide of SEQ ID NO: 
5, listed in Table 1. Ibis peptide was identified by Mass Spectrometry analysis of a species isolated 
fi-om the plasma of control individuals without Coronary Artery Disease (CAD) and characterized 
according to the MicroProt.TM method, as described in Exan5)le 1 . This species was not detectable 
in plasma from patients affected with CAD . 

The CPPs of the invention are all less than or aroxmd 20kD in molecxilar weight, as the 
plasma sample is first separated based on molecular weight. Higher molecular weight polypeptide 
species are separated and characterized by a different method. As described in Example 1, the 
plasma sample is subjected to a number of chromatography separations. Details about these 
chromatography methods are given in Example 1 . 

The first separation is on a cation exchange chromatography column, which is eluted with 
increasing salt concentration. Eighteen firactions are collected. The CEX column in Table 1 lists 
which fiBction contained the tryptic peptide, as well as its elution conditions. Separation by cation 
exchange provides an indication of the overall positive charge of a polypeptide species. Cation 
exchange is followed by a reverse phase HPLC separation. The RPl column in Table 1 lists in which 
of the 30 fractions the tryptic peptide eluted, as well as their elution conditions. Separation by 
reverse phase provides an indication of the overall hydrophobidty of a polypeptide species The last 
* two digits of the column labeled Run Number indicate which of the 24 eluted firactions contained the 
tryptic peptide from the second reverse phase HPLC separation (see Example 1). 

The information provided in Table 1 reveals a number of characteristics of the polypeptide 
species present in plasma of individuals without CAD. For example, tiie same tryptic peptide is 
found in more than one fraction of the RPl separation. Since those RPl fractions are widely spaced, 
this indicates that the tryptic peptide results from different circulating polypeptide species. 
Therefore, the appearance in Table 1 of the same peptide sequence in fraction 16 and fraction 20 of 
the RPl separation indicates that there exists two polypeptide species from which the tryptic peptide 
was derived, and which have different overall hydrophobicities. 

The observed tryptic peptide of SEQ ID NO:5 is derived from liie polypeptides of SEQ ID 
N0s:3-4 after trypsin digestion as described in Example 1 . The polypeptides of SEQ ID NOs:3-4 are 
arrived at by removal, on the propeptide of SEQ ID NO : 1, of the predicted signal sequence and by 
successive proteolyses at dibasic sites. Given the separation according to molecular weight 
undergone by the detected species (described in Step 2 of Example 1), CPP 6 and 7 (SEQ ID N0s:3 
and 4, respectively) represent the likely plasma polypeptides from which the tryptic peptide of SEQ 
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ID N0:5 is derived aiKiwMch are observed in RPlfi^ 16 and 20 (Table 1). Assuch,CPP6 
and 7 aie especially valuable for diagnostic and liierapeutic strategies relating to cardiovascular 
disorders. 

CPP nucleic acids 

One aspect of the invmtion pertains to purified or isolated nucleic acid molecules that encode 
CPPs or biologically active portions thereof as further described herein, as well as nucleic acid 
firagments thereof Said nucleic acids may be used for exanq)le in therapeutic and diagnostic 
methods and in drug screening assays as further described herein. 

An object of the invention is a purified, isolated, or recombinant nucleic acid coding for a 
CPP, complementary sequences thereto, and fragments thereof The invention also pertains to a 
piuified or isolated nucleic acid comprisiug a polynucleotide having at least 95% nucleotide identity 
with a polynucleotide coding for a CPP, advantageously 99 % nucleotide identity, preferably 99.5% 
nucleotide identity and most preferably 99.8% nucleotide id&a&y with a polynucleotide coding for a 
CPP, or a sequence complementary thereto or a biologically active firagment thereof Another object 
of the invention relates to purified, isolated or recombinant nucleic acids comprising a polynucleotide 
that hybridizes, under the stringent hybridization conditions defined herein, with a polynucleotide 
coding for a CPP, or a sequmce complementaiy thereto or a variant thereof or a biologically active 
firagment thereof . 

In another preferred aspect, the invention pertains to purified or isolated nucleic acid 
molecules that encode a portion or variant of a CPP, whereia the portion or variant displays a CPP 
biological activity of the invention. Preferably said portion or variant is a portion or variant of a 
naturally occurring CPP or precxursor thereof. 

Another object of the invention is a purified, isolated, or recombinant nucleic acid encoding a 
CPP comprising the amino add sequence selected from the group of SEQ ID NOs:3-5, or fi^gments 
thereof, wherein the isolated nucleic acid molecule encodes one or more motifs (such as a CPP target 
binding region, a secretion signal peptide, or a dibasic cleavage site). 

The nucleotide sequence determined from the cloning of the CPP-encoding gene allows for 
the generation of probes and primers designed for use in identifying and/or cloning other CPPs (e.g. 
sharing the novel functional domains), as well as CPP homologues from other species, 

A nucleic acid fi:agment encoding a "biologically active portion of a CPP" can be prepared 
by isolating a portion of a nucleotide sequence coding for a CPP, which racodes a polypeptide 
having a CPP biological activity, e3q)ressing the encoded portion of the CPP (e.g., by recombinant 
expression in vitro or in vivo) and assessing the activity of the encoded portion of the CPP. 

The invention further encompasses nucleic acid molecules that differ fix)m the CPP 
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nucleotide sequences of fhe inveation due to degeneracy of the genetic code and encode the same 

CPPs of the invoition. 

In addition to Ihe CPP nucleotide sequences described above, it will be appreciated by those 
skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid sequences 
5 of the CPPs may exist within a population (e.g., the human population). Such genetic polymorphism 
may exist among individuals within a population due to natural allelic variation. Such natural allelic 
variations can typically result in 1-5% variance in the nucleotide sequence of a CPP-encoding gene 
or nucleic acid sequence. 

Nucleic acid molecules corresponding to natural allelic variants and homologues of the CPP 
10 nucleic acids of the invention can be isolated based on their homology to the CPP nucleic acids 
disclosed hereia using the cDNAs disclosed herein, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

It will be appreciated that the invention comprises polypeptides having an amiao acid 
sequence encoded by any of the polynucleotides of the invention. 

15 

Uses of CPP nucleic acids 

Polynucleotide sequences (or the complements thereof) encoding CPPs have various 
applications, including uses as hybridization probes, in chromosome and gene mapping, and in the 
generation of antisense KNfA and DNA. In addition, CPP-encoding nucleic adds are usefiil as targets 

20 for pharmaceutical intervention, e.g. for the development of DNA vaccmes or antisense 

compositions, and for the preparation of CPPs by recombinant techniques, as described herein. The 
polynucleotides described herem, includiag sequence variants thereof can be used m diagnostic 
assays. Accordingly, diagnostic methods based on detecting the presence of such polynucleotides in 
body fluids or tissue samples are a feature of the present invention. Examples of nucleic add based 

25 diagnostic assays in accordance with the present invention include, but are not limited to, 

hybridization assays, e.g., in situ hybridization, and PCR-based assays. Polynucleotides, mcluding 
extended length polynucleotides, sequence variants and fragments thereof, as described herein, may 
be used to generate hybridization probes or PCR primers for use in sudi assays. Such probes and 
primers will be capable of detecting polynucleotide sequences, including genomic sequences that are 

30 similar, or complementary to, the CPP polynucleotides described herein. • 

The invention includes primer pairs for carrying out a PCR to amplify a segment of a 
polynucleotide of the invention. Each primer of a pair is an oligonucleotide having a length of 
between 15 and 30 nucleotides such that i) one primer of the pair forms a perfectly matdied duplex 
with one strand of a polynucleotide of the invention and the other primer of the pair form a perfectiy 

35 match duplex with the complonentary strand of the same polynucleotide, and ii) the primers of a pair 
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form such perfectly matched duplexes at sites m the polynucleotide that separated by a distance of 

between 10 and 2500 nucleotides. Preferably, the annealing tenq)erature of each primer of a pair to 

its respective complementary sequence is substantially the same. 

Hybridization probes derived from polynucleotides of the invention can be used, for 

S example, in performing in situ hybridization on tissue samples, such as fixed or frozen tissue sections 
prepared on microscqpic slides or suspended cells. Briefly, a labeled DNA or KNA probe is allowed 
to bind its DNA or RNA target sample in the tissue section on a prepared microscopic, under 
controlled conditions. Generally, dsDNA probes consisting of the DNA of interest cloned into a 
plasmid or bacteriophage DNA vector are used for this purpose, although ssDNA or ssRNA probes 

10 may also be used. Probes are generally oligonucleotides between about 15 and 40 nucleotides in 
length. Alternatively, the probes can be polynucleotide probes generated by PGR random priming 
primer extension or in vitro transcription of RNA from plasmids (riboprobes). These latter probes 
are typically several hundred base pairs in length. The probes can be labeled by any of a number of 
label groups and the particular detection method will correspond to the type of label utilized on the 

15 probe (e.g., autoradiography. X-ray detection, fluorescent or visual microscopic analysis, as 

appropriate). The reaction can be further amplified in situ using intmiunocytochemical techniques 
directed against the label of the detector molecule used, such as an antibody directed to a fluorescein 
moiety present on a fluoresc^otly labeled probe. Specific labeling and in situ detection methods can 
be foun4 for example, in Howard, G. C, Ed., Methods in Nonradioactive Detection, Appleton & 

20 Lange, Norwalk, Conn., (1993), herein incorporated by reference. 

Hybridization probes and PCR primers may also be selected from the genonuc sequences 
correspondiag to the fiiU-length proteins identified in accordance with the present iavention, 
including promoter, enhancer elemmts and introns of the gene encoding the naturally occurring 
polypeptide. Nucleotide sequences encodiog a CPP can also be used to construct hybridization 

25 probes for mapping the gene encoding that CPP and for the genetic analysis of individuals. 

Individuals carrying variations of, or mutations in the gene encoding a CPP of the present invention 
may be detected at the DNA level by a variety of techniques. Nucleic adds used for diagnosis may 
be obtained from a patient's cells, including, for example, tissue biopsy and autopsy material. 
Genomic DNA may be used directly for detection or may be amplified enzymatically by using PCR 

30 (Saiki, et al. Nature 324: 163-166 (1986)) prior to analysis. RNA or cDNA may also be used for the 
same purpose. As an example, PCR primers complementary to the nucleic acid of the present 
invention can be used to identify and analyze mutations in the gene of the present invention. 
Deletions and insertions can be detected by a change in size of Ihe amplified product in con:q)arison 
to the normal genotype. Point mutations can be identified by hybridi2dng amplified DNA to 

35 radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA sequences of the 
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invention. Sequence dianges at specific locations may also be revealed by nuclease protection 
assays, such as RNase and SI protection or the chemical cleavage method (e.g. Cotton, et al., Proc, 
Natl Acad Sci, uisA 85:4397-4401 (1985)), or by diflferences in melting temperatures. '^Molecular 
beacons" (Kostiikis L. G. et al.. Science 279:1228-1229 (1998)), haiipin-shaped, single-stranded 
5 synthetic oligonucleotides containing probe sequences which are complementary to the nucleic acid 
of the present invention, may also be used to detect point mutations or other sequence changes as 
well as monitor expression levels of CPPs. 

Oligonucleotide and Antisense Compoimds 

10 Oligonucleotides of the invention, including PGR primers and antisense compounds, are 

synfiiesized by conventional means on a commercially available automated DNA synthesizer, e.g. an 
Applied.Biosystems (Foster Chy, CA) model 380B, 392 or 394 DNA/RNA synthesizer, or like 
instrument. Preferably, phosphoramidite chemistry is employed, e.g. as disclosed in the following 
references: Beaucage and Iyer, Tetrahedron, 48: 2223-23 1 1 (1992); Molko et al, U.S. patent 

15 4,980,460; Koster et al, U.S. patent 4,725,677; Carufhers et al, U.S. patents 4,415,732; 4,458,066; 
and 4,973,679; and the like. For therapeutic use, nuclease resistant backbones are preferred. Many 
types of modified oligonucleotides are available that confer nuclease resistance, e.g. 
phosphorothioate, phosphorodithioate, phosphoramidate, or the like, described in many references, 
e.g. phosphorothioates: Stec et al, U.S. patent 5,151,510; Hirschbem, U.S. patent 5,166,387; 

20 Beigot, U.S. patent 5,183,885; phosphoramidates: Froehler et al, Intemational application 

PCTAJS90/03 138; and for a review of additional applicable chemistries: Uhknann and Peyman 
(cited above). The length of the antisense oligonucleotides has to be sufiSciently large to ensure that 
specific binding will take place only at the desired target polynucleotide and not at other fortuitous 
sites. The upper range of the length is determined by several fectors, including the mconvenience 

25 and expense of synthesizing and purifying oligomers greater than about 30-40 nucleotides in length, 
the greater tolerance of longer oUgonucleotides for mismatches than shorter oligonucleotides, and the 
like. Preferably, the antisense oligonucleotides of the invention have lengths in the range of about 15 
to 40 nucleotides. More preferably, the oligonucleotide moieties have lengths in the range of about 
18 to 25 nucleotides. 

30 

Primers and probes 

Primers and probes of the invention can be prepared by any suitable method, including, fer 
^cample, cloning and restriction of appropriate sequences and direct chemical synthesis by a method 
such as the phosphodiester mediod of Narang SA et al (Methods Enzymol 1979;68:90-98), the 
35 phosphodiester method of Brown EL et al (Methods l&izymol 1979;68: 109-15 1), the 
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diethylphosphoramidite method of Beaucage et al (Tetrahedron Lett 1981, 22: 1859-1862) and die 

solid support method described in EP 0 707 S92, the disclosures of v^faich are incoxporated herein by 

reference in their entireties. 

Detection probes are generally nucleic add sequences or uncharged nucleic acid analogs 

S such as, for example peptide nucleic acids whidi are disdosed in International Patent Application 

WO 92/20702, morpholino analogs which are described in U.S. Patrats Numbered 5,185,444; 

5,034,506 and 5,142,047. If desired, the probe may be rendered '*non-extendable" in that additional 

dNTPs cannot be added to the probe. In and of themselves analogs usually are non-extendable and 

nucleic acid probes can be rendered non-extendable by modifying the 3* end of the probe such that 

10 the hydroxyl group is no longer capable of participating in elongation. For example, the 3* end of the 

probe can be fimctionalized with the capture or detection label to thereby consume or otherwise 

blodc the hydroxyl group. 

Any of the polynucleotides of the present invention can be labeled, if desired, by incorporating 

any label group known in the art to be detectable by spectroscopic, photochemical, biochemical, 

15 immunodiemical, or chemical means. Additional examples include non-radioactive labeling of 

nucleic acid fragments as described in Urdea et al. (Nudeic Adds Research. 1 1:4937-4957, 1988) or 

Sanchez-Pescador et al. (J. Chn. Microbiol. 26(10):1934-1938, 1988). hi addition, the probes 

accordiiig to the present invention may have structural characteristics such that they allow the signal 

amplification, sudi structural diaracteristics being, for exanq)le, branched DNA probes as those 

20 described by Urdea et al (Nucldc Adds Symp. Ser. 24: 197-200, 1991) or in the European patent 

No. EP 0225807 (Chiron). 

A label can also be used to capture the primer, so as to fiicihtate the immobilization of either 

the primer or a primer extension prod u ct such as amplified DNA, on a soUd support. A capt ure 

label is attached to the primers or probes and can be a specific binding member yAddk forms a 

25 binding pair with the soUd's phase reagent's specific binding member (e.g. biotin and streptavidin). 

Therefore depending upon the type of label carried by a polynucleotide or a probe, it may be 

employed to capture or to detect the target DNA. Further, it will be imderstood that the 

polynucleotides, primers or probes provided herein, may, themselves, serve as the capture label. For 

example, in the case where a soUd phase reagent's binding member is a nucleic acid sequence, it may 

30 be selected such that it binds a complementary portion of a primer or probe to thereby immobilize the 

primer or probe to the soKd phase. In cases where a polynucleotide probe itself serves as the binding 

member, those skilled in the art will recognize that the probe wiU contain a sequence or *^tail" that is 

not complementary to the target. In the case where a polynucleotide primer itself serves as the 

capture label, at least a portion of the primer wiU be fi:ee to hybridize with a nucleic add on a solid 

35 phase. DNA labeliiigtedmiques are wdl known to the skilled technician. 
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The probes of tiie present mvention are useful for a number of purposes. They can be notably 
used in Southern hybridizatioti to genomic DNA. The probes can also be used to detect PGR 
anq)Iification products. They may also be used to detect mismatches in CPP-encoding genes or 
mRNA usiiig other techniques. 

Any of the nucleic acids, polynucleotides, primers and probes of the present invention can be 
conveniently umnobilized on a solid support. Solid supports are known to those skilled in the art and 
include the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, 
nitrocellulose strips, membranes, microparticles such as latex particles, sheep (or other animal) red 
blood cells, duracytes and others. The solid support is not critical and can be selected by one skilled 
in the art. Thus, latex particles, microparticles, magnetic or non-magnetic beads, membranes, plastic 
tubes, walls of microtiter wells, glass or silicon chips, sheep (or other suitable animal's) red blood 
cells and duracytes are all suitable exanq)les. Suitable methods for immobilizing nucleic acids on 
solid phases include ionic, hydrophobic, covalent interactions and the like. A solid support, as used 
herein, refers to any material which is insoluble, or can be made insoluble by a subsequent reaction. 
The solid support can be chosen for its intrinsic ability to attract and immobilize the capture rea^nt. 
Alternatively, the solid phase can retain an additional receptor which has the ability to attract and 
immobilize the capture reagent. The additional receptor can include a charged substance that is 
oppositely charged with respect to the c^rture reagent itself or to a charged substance conjugated to 
the capture reagent. As yet another alternative, the recq)tor molecuk can be any specific binding 
member attached to Ifae solid support and which has the ability to immobilize tiie capture reagent 
Ihrougji a specific binding reaction. The receptor molecule enables the indirect binding of Ae 
capture reagent to a solid support material before the performance of the assay or during the 
performance of the assay. The solid phase thus can be a plastic, derivatized plastic, magnetic or 
non-magnetic metal, gjass or silicon sur&ce of a test tube, microtiter well, sheet, bead, microparticle, 
chip, sheep (or other suitable animal's) red blood cells, duracytes and other configurations known to 
those of ordinary skill in the art. The nucleic acids, polynucleotides, primers and probes of the 
invention can be attached to or immobilized on a solid support individually or in groups of at least 2, 
5, 8, 10, 12, 15, 20, or 25 distinct polynucleotides of the invention to a single solid support. In 
addition, polynucleotides other than those of the invention may be attached to the same solid support 
as one or more polynucleotides of the invention. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
locations on a soUd support. Alternatively the polynucleotides of the invention may be attached in an 
ordered array wherein each polynucleotide is attached to a distinct region of the solid support which 
does not overlap with the attachment site ofwy other polynucleotide. Preferably, such an ordered 
array of polynucleotides is designed to be "addressable" v/b&rc tiie distinct locations are recorded and 
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can be accessed as part of an assay procedure. Addressable polynucleotide arrays typically 
comprise a plnralify of dififerent oligonucleotide probes that are coupled to a sur&ce of a substrate in 
different known locations. The knowledge of the precis^ location of each polynucleotides location 
makes these "addressable" arrays particularly useful in hybridization assays. Any addressable array 
technology known in the art can be employed with tiie polynucleotides of the invention. One 
particular embodiment of these polynucleotide arrays is known as the Gtaiechips, and has been 
generally described in US Patent 5,143,854; PCT pubUcations WO 90/15070 and 92/10092, tiie 
disclosures of which are incorporated herein by reference in their entireties. 

Methods for obtaining variant nucleic acids and polypeptides 

In addition to naturally-occurring alleUc variants of the CPP sequences tiiat may exist in the 
population, the skilled artisan will appreciate that changes can be introduced by mutation into the 
nucleotide sequences coding for CPPs, thereby leading to changes in the amino acid sequence of the 
encoded CPPs, wiOi or witiiout altering the functional abiUty of the CPPs. 

Several types of variants are contemplated including 1) one in which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino acid residue and such 
substituted amino acid residue may or may not be one encoded by the genetic code, or 2) one in 
which one or more of the amino acid residues includes a substituent group, or 3) one in which the 
mutated CPP is fused with another compound, such as a compound to increase the half-life of the 
polypeptide (for example, polyethylene glycol), or 4) one in vdiich the additional amino adds are 
fused to the CPP, such as a leader, a signal or anchor sequence, a sequence which is employed for 
purification of the CPP, or sequence fi^om a precursor protein. Such variants are deemed to be within 
the scope of those skilled in Ihe art. 

For raample, nucleotide substitutions leadiag to amino acid substitutions can be made in the 
sequences that do not substantially change the biological activity of the protein. An amino acid 
residue-can be altered from the wild-type sequence encoding a CPP, or a biologically active fragment 
or homologue thereof without altering the biological activity. In general, amino acid residues that are 
shared among the CPPs of the present invention are predicted to be less amenable to alteration. 

In another aspect, the invention pertains to nucleic acid molecules encoding CPPs that 
contain changes in amino acid residues that result in increased biological activity, or a modified 
biological activity. In another aspect, the invention pertains to nucleic acid molecules encoding 
CPPs that contain changes in amino acid residues that are essential for a CPP biological activity. 
Such CPPs differ in amino acid sequence from SEQ ID NOs:3-4 and display reduced activity, or 
essentially lack one or more CPP biological activities. 

Mutations, substitutions, additions, or deletions can be introduced into any of SEQ ID 
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NOs: 1-5, by standard techniques, sudi as site-directed mutagenesis and PCR-mediated nmtagenesis. 
For example, conservative anuno acid substitutions may be made at one or more predicted nan- 
essential amino acid residues. A "conservative amino add substitution" is one in which the amino 
acid residue is replaced with an amino add residue having a similar side diain. Families of amino 
5 acid residues having similar side chains have been defined in the art Hiese £amilies include amino 
acids with basic side chains (e.g., lysine, argmine, histidine), acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, 
tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) 

10 and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted 
nonessential amino acid residue in a CPP, or a biologically active firagment or homologue thereof 
may be replaced with another amino acid residue from the same side chain femily. Alternatively, in 
another embodiment, mutations can be introduced randomly along all or part of a CPP coding 
sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for CPP • 

15 biological activity to identify mutants that retain activity. Following mutagenesis of the nucleotide 
encoding one of SEQ ED NOs: 1-5, the encoded protein can be expressed recombinantly and the 
activity of the protein can be determined in scay suitable assay, for example, as provided herein. 

The invention also provides CPP diimeric or fusion proteins. As used herein, a CPP 
"chimeric protein" or "fiision protein" conq;>rises a CPP of the invention or jfragment thereof 

20 operativdy linked or fused in fiiame to a non-CPP polypeptide sequence. In a preferred embodiment, 
a CPP fusion protein comprises at least one biologically active portion of a CPP. In another 
preferred embodiment, a CPP fusion protdn comprises at least two biologically active portions of a 
CPP. For example, in one embodiment, the fusion protein is a GST-CPP fiision protein in whidi 
CPP domain sequences are fiised to the C-tenninus of the GST sequences. Sudi fusion proteins can 

25 fiidlitate Ihe pinification of recombinant CPPs. In another embodiment, the fiision protein is a CPP 
containing a heterologous signal sequence at its N-terminus, for example, to allow fi)r a desired 
ceUular localization in a certain host cell. In yet another embodiment, the fusion is a CPP 
biologically active firagment and an immunoglobulin molecule. Such fusion proteins are usefiil, for 
example, to increase the valency of CPP binding sites. For example, a bivalent CPP binding site 

30 may be formed by fusing biologically active CPP firagments to an IgG Fc protein. 

The CPP fusion proteins of the invention can be incorporated into pharmaceutical 
conq)ositions and administered to a subject in vivo. Moreover, the CPP fiision proteins of the 
invention can be used as immunogens to produce anti-CPP antibodies in a subject, to purify CPP or 
CPP ligands and in screening assays to identify molecules which modulate the interaction of CPP 

35 with a CPP target molecule. 
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Furthennore, isolated fragments of CPPs can also be obtained by screening peptides 

recombinandy produced from the corresponding fragment of the nucleic acid encoding such peptides. 

In addition, fragments can be chemically synthesized using techniques knoi?m in the art such as 

conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, a CP? of the present 

invention may be arbitrarily divided into fragments of desired lerigth with no overlap of the 

fragments, or preferably divided into overlapping fragments of a desired length. The fragments can 

be produced (recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments 

with a CPP biological activity, for example, by microinjection assays or in vitro protein binding 

assays. In an illustrative embodiment, peptidyl portions of a CPP, such as a CPP target binding 

region, can be tested for CPP activity by expression as thioredoxin fusion proteins, each of which 

contains a discrete fragment of the CPP (see, for example, U.S. Patents 5, 270,181 and 5,292,646; 

and PCX publication W094/02502, the disclosures of which are incorporated herein by reference). 

In addition, hbraries of fragments of a CPP coding sequence can be used to generate a 

variegated population of CPP fragments for screening and subsequent selection of variants of a CPP. 

In one embodiment, a Ubrary of coding sequence fragments can be generated by treating a double 

stranded PCR fragment of CPP coding sequence with a xmclease under conditions wherein nicking 

occiurs only about once per molecule, dsiaturing the double stranded DNA, renaturing the DNA to 

form double stranded DNA whidi can include sense/antisense pairs from different nicked products, 

removing single stranded portions from reformed duplexes by treatment with SI nuclease, and 

Ugating the resulting fragment library into an e7q)ression vector. By this method, an e?q>ression 

library can be derived which encodes N-tenninal, C-terminal and internal fragments of various sizes 

of the CPP. 

Modified CPPs can be used for such purposes as enhancing therapeutic or prophylactic 
e£5cacy, or stability (e.g., ex vivo shelf life and resistance to proteolj^c degradation in vivo). Such 
modified peptides, when designed to retain at least one activity of the naturally occurring form of the 
protein, are considered fimctional equivalents of the CPP described in more detail herein. Such 
modified peptide can be produced, for instance, by amino acid substitution, deletion, or addition. 

Whether a change in the amino acid sequence of a peptide results in a fimctional CPP 
homolog can be readily determined by assessmg at least one CPP biological activity of the variant 
peptide. Peptides in which more than one replacement has taken place can readily be tested in the 
same manner. 

This invention finlher contemplates a method of generating sets of combinatorial mutants of 
the presently disclosed CPPs, as well as truncation and fragmentation mutants, and is especially 
usefiil for identifying potential variant sequences which are fimctional in binding to a CPP target 
protein but differ from a wild-type form of the protein by, for example, e£Bca(^, potency and/or 
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intracelliilar half-life. One purpose for screening such combinatorial libraries is, for exaixiple, to 
isolate novel CPP homologs with altered biological activity vfhsa cosnpared with the wild-type 
protein, or alternatively, possessing novel activities all together. For example, mutagenesis can give 
rise to CPP homologs i?^ch have intracellular half-lives dramatically different ibsn the 
5 corresponding wild4ype protein. The altered protein can be rendered either more stable or less stable 
to proteolytic degradation, or cellular processes which result in destruction ot or otherwise 
inactivation o^ a CPP. Such CPP homologs, and the genes which encode them, can be utilized to 
alter the envelope of expression for a particular recombinant CPP by modulating the half-life of the 
recombinant protein. For instance, a short half-life can give rise to more transient biological effects 
10 associated with a particular recombinant CPP and, when part of an inducible expression system, can 
allow tighter control of recombinant protein levels within a cell and in circulating plasma. As above, 
such proteins, and particularly their recombinant nucleic acid constracts, can be used in gene therapy 
protocols. 

In an illustrative embodiment of this method, the amino acid sequences for a population of 

15 CPP homologs or other related proteins are aligned, preferably to promote the highest homology 
possible. Such a population of variants can include, for example, CPP homologs from one or more 
species, or CPP homologs from the same species but which differ due to mutation. Amino acids 
v^di appear at each position of the aligned sequences are selected to create a degenerate set of 
combinatorial sequences. There are many ways by which the library of potential CPP homologs can 

20 be generated from a degenerate oligonucleotide sequence. Ch^nical synthesis of a degenerate gene 
sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes then be Ugated 
into an appropriate gene for expression. The purpose of a degenerate set of genes is to provide, in 
one mixture, all of the sequences encoding the desired set of potential CPP sequences. The synthesis 
of degenerate oligonucleotides is well known in the art (see for exanq)le. Narang, SA (1983) 

25 Tetrahedron 393; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. 

Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp. 273-289; Itakura et al. (1984) Annu. 
Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 
1 1:477. Such techniques have been employed in the directed evolution of other proteins (see, for 
exanq)le, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; 

30 Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as weU as 
U.S. Patents Nos: 5, 223,409, 5,198,346, and 5,096,815). The disclosures of the above references 
are incorporated herein by reference in their entireties. 

Alternatively, other forms of mutagenesis can be utilized to generate a combinatorial Ubrary, 
particularly where no other naturally occurring homologs have yet been sequenced. For example, 

35 CPP homologs (both agonist and antagonist forms) can be generated and isolated from a library by 
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screening using, for example, alanine scaiming mutagenesis and the like (Ruf et al. (1994) 
Biochemistry 33:1565-1572; Wang et al. (1994) J Biol. CSiem, 269:3095-3099; Balint et al. (1993) 
Gene 137:109-1 18; Grodberg et al. (1993) Eur. J Biochem. 218:597-601; Nagashima et al. (1993) J 
Biol. Chem. 268:2888-2892; Lowman et al. (1991) Biochemistry 30:10832-10838; and Cunningham 
et al. (1989) Science 244:1081-1085), by linker scanning mutagenesis (Gustin et al. (1993) Virology 
193:653-660; Brown et al. (1992) Mol. Cell Biol. 12:2644 2652; McKnight et al. (1982) Science 
232:316); by saturation mutagenesis (Meyers et al. (1986) Science 232:613); by PCR mutagenesis 
(Leung et al. (1989) Method Cell Mol Biol 1: 1-19); or by random mutagenesis (Miller et al. (1992) 
A Short Course in Bacterial Genetics, CSHL Press, Cold Spring Harbor, NY; and Greener et al. 
(1994) Strategies in Mol Biol 7:32-34, the disclosures of which are incorporated herein by reference 
in their entireties). 

A further method exploits automatic protein design to generate protein libraries for screening 
and optimization of the sequence of a protein of the invention. See, for example, U.S. Patent 
6403312, disclosure of which is incorporated herein by reference. Briefly, a primary Ubrary is 
generated using conq)utational processing based on the sequence and structural characteristics of the 
CPP. Generally speaking, the goal of the computational processing is to determine a set of optimized 
protein sequences that result in the lowest energy conformation of any possible sequence. However, a 
plurality of sequences that are not the global minimum may have low energies and be useful. Thus, 
a primary library comprising a rank ordered list of sequences, generally in terms of theoretical 
quantitative stabiUty, is generated. THbese sequences may be used to synthesize or express peptides 
displaying an extended half-life or stabiUzed interactions with CPP bmding compounds and proteins. 

A wide range of techniques are known in the art fer screening gene products of 
combinatorial libraries made by pomt mutations, as well as for screening cDNA libraries for gene 
products having a certain property. Such techniques will be generally adaptable for rapid screening 
of the gene libraries generated by the combinatorial mutagenesis of CPPs. The most widely used 
techniques for screening large gene libraries typically comprises cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting Ubrary of vectors, 
and expressing the combinatorial genes under conditions in which detection of a desired activity 
feicihtates relatively easy isolation of the vector encoding the gene whose product was detected. 

Eadi of the illustrative assays described below are amenable to high throughput analysis as 
necessary to screen large numbers of degraerate CPP sequCTces created by combinatorial 
mutagenesis techniques. In one screening assay, the candidate gene products are displayed on the . 
surfece of a cell or viral particle, and the ability of particular cells or viral particles to bind a CPP 
target molecule (for example a modified peptide substrate) via this gene product is detected in a 
"panning assay". For instance, the gene library can be cloned into the gene for a surfece membrane 
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protein of a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 
88/06630; Fuchs et al. (1991) BioTechnology 9:1370-1371, and Goward et al. (1992) TffiS 18:136 
140). In a similar feshion, fluorescently labeled CPP target can be used to score for potentially 
functional CPP homologs. Cells can be visually inspected and separated under a fluorescence 
5 naicroscope, or, where the morphology of the cell permits, separated by a fluorescence- activated cell 
sorter. 

In an alternate embodiment, the gene library is expressed as a fusion protein on the surfece 
of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences can be 
expressed on the surfece of infectious phages, thereby conferring two significant benefits. First, since 

10 these phages can be applied to affinity matrices at very high concentrations, a large number of phage 
can be screened at one time. Second, since each infectious phage displays the combinatorial gene 
product on its surface, if a particular phage is recovered firom an afiBnity matrix in low yield, the 
phage can be amplified by another round of infection. The group of almost identical E. coli 
filamentous phages M13, fd, and fl are most often used in phage display libraries, as either of the 

15 phage glU or gVUI coat proteins can be \ised to gmerate fiision proteins without disrapting the 

ultimate packaging of the viral particle (Ladner et al. PCT publication WO 90/02909; Garrard et al., 
PCX pubUcation WO 92/09690; Marks et al. (1992) J Biol. Chem. 267:16007-16010; Griffiths et al. 
(1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) 
PNAS 89:4457 4461, the disclosures of which are incorporated herein by reference in their 

20 entireties). In an illukrative mibodimen^ the recombinant phage antibody system (RPAS, Phaxmada 
Catalog number 27-9400-01) can be easily modified for use in expressing CPP combinatoiial 
libraries, and the CPP phage libraiy can be panned on immobilized CPP target molecule (glutathione 
immobilized CPP target-GST fiision proteins or immobilized DNA). Successive rounds of phage 
amplification and panning can greatly emich for CPP homologs vAdch retain an ability to bind a 

25 CPP target and which can subsequently be screened fiulher for biological activities in automated 
assays, in order to distinguish between agonists and antagonists. 

The invention also provides for identification and reduction to fimctional minimal size of the 
CPP fimctional domains, to generate mimetics, e.g. peptide or non-peptide agents, which are able to 
dismpt binding of a polypeptide of the present invention with a CPP target molecule. Thus, such 

30 mutagenic techniques a^ described above are also usefiil to map the determinants of CPPs 

participating in protein-protein interactions involved in, for example, binding to a CPP target protein. 
To illustrate, the critical residues of a CPP involved in molecular recognition of the CPP target can 
be determined and used to generate CPP target-13P-derived peptidomimetics that competitively 
mhibit binding of the CPP to the CPP target. For instance, non hydrolysable peptide analogs of such 

35 residues can be generated usmg retro-inverse peptides (e.g., see U.S. Patents 5,1 16,947 and 
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5,219,089; and Pallai et al. (1983) Int J Pept Protein Res 21:84-92), benzodiazepine (e.g., see 
Freidinger et al. in P^tides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), azspine (e.g., see HuflBtnan et al. in Peptides. Chemistry and Biology, G.R- 
Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gamma lactam rings 
5 (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and 
Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide 
Symposium) Pierce Chemical Co. Rockland, XL, 1985), P-tum dipeptide cores (Nagai et al. (1985) 
Tetrahedron Left 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1: 123 1), and P- 
10 aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and Dann et al. 

(1986) Biochem Biophys Res Commun 134:71, the disclosures of which are incorporated herein by 

reference in their entireties). 

Chemical Manufacture of CPP Compositions 

15 Peptides of the invention are synthesized by standard techniques (e.g. Stewart and Young, 

Sohd Phase Peptide Synthesis, 2nd Ed., Pierce Chemical Con^any, Rockford, IL, 1984). 
Preferably, a commercial peptide synthesizer is used, e.g. Applied Biosystems, Inc. (Foster City, 
CA) model 430A, and polypeptides of the invention may be assembled from multiple, separately 
synthesized and purified, peptide in a conveigent synthesis approach, e.g. Kent et al, U.S. patent 

20 6,184,344 and Dawson and Kent, Annu. Rev, Biochem., 69: 923-960 (2000). Peptides of the 

invention may be assembled by soUd phase synthesis on a cross-linked polystyrene support starting 
from the carboxyl terminal residue and adding amino acids in a stepwise feshion until the entire 
peptide has been formed. The following references are guides to the chemistry employed during 
synthesis: Schnolzer et al. Int. J. Peptide Protein Res., 40: 180-193 (1992); Merrifield, J. Amer. 

25 Chem. Soc, Vol. 85, pg. 2149 (1963); Kent et al., pg 185, in Peptides 1984, Ragnarsson, Ed. 

(Ahnquist and WekseU, Stockhohn, 1984); Kent et al., pg. 217 in Peptide Chemistry 84, Izumiya, 
Ed. (Protein Research Foundation, B.H. Osaka, 1985); Merrifield, Science, Vol, 232, pgs. 341-347 
(1986); Kent, Ann. Rev. Biochem, Vol. 57, pgs. 957-989 (1988), and references cited in these latter 
two references. 

30 , Preferably, chemical synthesis of polypeptides of the invention is carried out by the assembly 
of peptide fragments by native chemical Ugation, as described by Dawson et al. Science, 266: 776- 
779 (1994) and Kent el al, U.S. patent 6, 184,344. Briefly, in the approach a jfirst peptide fiagment 
is provided with anN-terminal cysteine having an unoxidized sulfibydryl side chain, and a second 
peptide fiagment is provided with a C-tenninal thioester. The unoxidized sulfhydryl side chain of the 

35 N-terminal cysteine is then condensed with tiie C-terminal thioester to produce an intennediate 
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peptide fragment vUdch links fhe first and second peptide fiiigments with a p-aniinothioester bond. 
The p-aminotiiioester bond of the intennediate peptide fiagment then undergoes an intramolecular 
rearrangement to produce the peptide firagment product which links the first and second peptide 
fi-agments with an amide bond. Preferably, the N-tOTninal cysteine of the mtemal firagmetxts is 
5 protected firom undesired cyclization and/or concatenation reactions by a cyclic thiazolidine 
protecting group as described below. Preferably, such cyclic thiazolidiae protecting group is a 
thioprolinyl group. 

Peptide fragments having a C-terminal thioester may be produced as described in the 
following references, which are incorporated by reference: Kent et al, U.S. patent 6,184,344; Tam et 
10 al, Proc. Nati. Acad. Sci., 92: 12485-12489 (1995); Blake, Int. J. Peptide Protein Res., 17: 273 
(1981); Canne et al. Tetrahedron Letters, 36: 1217-1220 (1995); Hackeng et al, Proc. Nati. Acad. 
Sci., 94: 7845-7850 (1997); or Hackeng et al, Proc. Nati. Acad. Sci., 96: 10068-10073 (1999). 
Preferably, the method described by Hackeng et al (1999) is employed. Briefly, peptide fragments 
are synthesized on a solid phase support (described below) typically on a 0.25 mmol scale by using 
15 the in situ neutralization/EIBTU activation procedure for Boc chemistry disclosed by Schnolzer et al, 
fat. J. Peptide Protein Res., 40: 180-193 (1992), which reference is incorporated herein by reference. 
(HBTU is 2-(lH-behzotriazol-l-yl)-l,l,3,3-tetramethyluroiiium hexafluorophosphate and Boc is 
tert-butoxycarbonyl). Each synthetic cycle consists of N*-Boc removal by a 1- to 2- miimte 
treatment witii neat TFA, a 1-minute DMF flow wash, a 10- to 20-minute coupling time with 1.0 
20 mmol of preactivated Boc-amino acid in the presence' of DIEA, and a second DMF flow wash. 
(TFA is trifluoroacetic acid, DMF is N,N-dimefhylfonnamide, and DIEA is N,N- 
diisopropylethylamine). N*-Boc-aniino acids (1.1 nnnol) are preactivated for 3 nainutes with 1.0 • 
mmol of HBTU (0.5 M in DMF) in the presence of excess DIEA (3 mmol). After each coupling 
step, yields are determined by measuring residual free amine with a conventional quantitative 
25 ninhydrin assay, e.g. as disclosed in Sarin et al. Anal. Biochem., 1 17: 147-157 (1981). After 

coupling of Ghi residues, a DCM flow wash is used before and after deprotection by using TFA, to 
prevent possible high-temperature (TFA/DMF)-catalyzed pyrrolidone formation. After chain 
assembly is conn5)leted, the peptide firagments are deprotected and cleaved from the resin by treatment 
with anhydrous HF for 1 hour at O^^C with 4% /Msresol as a scavenger. The imidazole side-chain 
30 2,4-dinitrophenyl (dnp) protecting groups remain on the His residues because the dnp-removal 

procedure is inconopatible with C-tenninal tiiioester groups. However, dnp is gradually removed by 
. thiols during the ligation reaction. After cleavage, pq;>tide fi:agnaients are precipitated with i<^co^^ 
diethylefher, dissolved in aqueous acetonitrile, and lyophilized. 

Thioester peptide firagments described above are preferably synthesized on a trityl-associated 
35 mercaptopropionic acid-leudne (TAMPAL) resin, made as disclosed by Hackeng et al (1999), or 
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comparable protocol. Briefly, N*-Boc-Leu (4 mmol) is activated with 3.6 mmol of HBTU in the 
presence of 6 mmol of DIEA and coiq)led &t 16 mimites to 2 mmol of p-xn^thylbenzhydiylaimne 
(MBHA) resin, or tiie equivalent. Next, 3 nmolof S4iitylmercaptqpr(q>ionicaddisactiA^^ 
2.7 mmol of HBTU in the presence of 6 mmol of DIEA and coupled for 16 minutes to Leu-MBHA 

5 resin. The resulting TAMPAL resin can be used as a starting resin for polypeptide-chain assembly 
after removal of the trityl protecting group with two 1-minute treatments with 3.5% 
triisopropylsilane and 2.5% H2O in TFA. The thioester bond can be formed with any desired amino 
acid by using standard in situ-neutralization peptide coupling protocols for 1 hour, as disclosed in 
Schnolzer et al (cited above). Treatment of the final peptide firagment with anhydrous HF yields the 

10 C-tenninal activated mercaptopropionic acid-Ieucine (MPAL) thioester peptide fragments. 

Preferably, thiazolidine-protected thioester peptide firagment intermediates are used in native 
chemical ligation under conditions as described by Hackeng et al (1999), or like conditions. Briefly, 
0.1 M phosphate buffer (pH 8.5) containing 6 M guanidine, 4% (vol/vol) benzyhnercaptan, and 4% 
(vol/vol) thiophenol is added to dry peptides to be ligated, to give a final peptide concentration of 1-3 

15 mM at about pH 7, lowered because of the addition of thiols and TFA firom flie lyophilized peptide. 
Preferably, liie ligation reaction is performed in a heating block at 37°C and is periodically vortexed 
to equilibrate the thiol additives. The reaction may be monitored for d^ree of completion by 
MALDI-MS or HPLC and elecixospmy ionization MS. 

After a native chemical ligation reaction is conq)leted or stopped, the N-temtiinal thiazolidine 

20 ring of the product is opened by tr ea tment with a cysteine deprotecting ag^xt, such as O- 

methylhydroxylamine (0.5 M) at pH 3.5-4.5 for 2 hours at 3T C, after which a 10-fold excess of 
Tris-(2-carboxyefliyl)-phosphine is added to the reaction mixture to completely reduce any oxidizing 
reaction constituents prior to purification of the product by conventional preparative HPLC. 
Preferably, firactions containing the ligation product are identified by electrospray MS, are pooled, 

25 and lyophilized. 

After the synthesis is completed and the final product purified, the final polypeptide product 
may be refolded by conventional techniques, e.g. Creighton, Meth. Enzymol., 107: 305-329 (1984); 
White, Meth. Enzymol., 11: 481-484 (1967); Weflaufer, Meth. Enzymol., 107: 301-304 (1984); and 
the Uke. Preferably, a final product is refolded by air oxidation by the following, or like: The 
30 reduced lyophilized product is dissolved (at about 0. 1 mg/mL) in 1 M guanidine hydrochloride (or 
like chaotropic agent) with 100 mM Tris, 10 mM methionine, at pH 8.6. After gentle overnight 
stirring, the re-folded product is isolated by reverse phase HPLC with convwtional protocols. 

Recombinant Expression Vectors and Host Cells 
35 The polynucleotide sequences described herein can be used m recombinant DNA molecules 
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that direct the e3q)ression of the conesponding polypeptides in appropriate host cells. Because of the 
degeneracy in the genetic code, other DNA sequences may encode tiie equivalent amino acid 
sequence, and may be used to clone and ejqpress the CPPs. Codons preferred by a particular host cell 
may be selected and substituted into the naturally occurring nucleotide sequences, to increase the rate 
and/or efficiency of e:q)ression. The nucleic acid (e.g., cDNA or genomic DNA) encoding the 
desired CPP may be inserted mto a replieable vector for cloning (amplification of the DNA), or for 
expression. The polypeptide can be expressed recombinantly in any of a munber of e3q)ression 
systems according to methods known in the art (Ausubel, et al., editors. Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, 1990). Appropriate host cells include yeast, 
bacteria, archebacteria, fungi, and insect and animal cells, including mammaUan cells, for example 
primary cells, including stem cells, including, but not hmited to bone marrow stem cells. More 
specifically, these include, but are not limited to, microorganisms such as bacteria transformed with 
recombinant bacteriophage, plasmid or cosmid DNA expression vectors, and yeast transformed with 
yeast ejcpression vectors. Also included, are insect cells infected with a recombinant insect virus 
(such as baculovirus), and manfiTnalian expression systems. The nucleic acid sequence to be 
e3q)ressed may be mserted into the vector by a variety of procedures. In general, DNA is inserted into 
an appropriate restriction endonuclease site using techniques known in the art. Vector components 
generally include, but are not limited to, one or more of a signal sequence, an origin of replication, 
one or more marker genes, an enhancer element, a promoter, and a transcription termination 
sequence. Construction of suitable vectors containing one or more of these conq)onents employs 
standard Ugation techniques which are known to the skilled artisan. 

The CPPs of the present invention are produced by cutturing a host cell transformed with an 
expression vector containing a nucleic acid encoding a CPP, under the appropriate conditions to 
induce or cause expression of the protein. The conditions appropriate for CPP e3q)ression will vary 
with the choice of the expression vector and the host cell, as ascertained by one skilled in the art. For 
example, the use of constitutive promoters in the expression vector may require routine optimization 
of host cell growth and proliferation, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest is 
important. For example, the baculoviral systems used in insect cell expression are lytic viruses, and 
thus harvest time selection can be crucial for product yield. 

A host cell strain may be chosen for its ability to modxilate the expression of the mserted 
sequences or to process the e3q>ressed protein in the desired feshion. Such modifications of the 
protein include, but are not limited to, glycosyl, acetyl, phosphate, amide, Upid, carboxyl, acyl, or 
carbohydrate groups. Post-translational processing, which cleaves a '^repro" form of the protein, 
may also be important for correct insertion, foldmg and/or fimction. By way of example, host cells 

35 



wo 2004/061096 PCT/EP2004/000027 
such as CHO, HfeLa, EHK, MDCK, 293, W138, etc. have specific cellular machineiy and 
diaracteristic mechanisms for such post-translational activities and may be chosen to ensure the 
correct modification and processing of the introduced, foreign protein. Of particular interest are 
Drosophila melanogaster cells, Saccharomyces cerevisiae and other yeasts, E, coli, Bacillus 
subtilis, SF9 cells, C129 cells, 293 cells, Neurospora, BHK, CHO, COS, and HeLa cells, 
fibroblasts, Schwanoma cell lines, immortalized mammalian myeloid and lymphoid cell lines, Jurkat 
ceUs, human cells and other primary cells. 

The nucleic acid encoding a GPP must be "operably linked" by placing it into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory 
leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates 
in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if 
it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to fecilitate translation. Generally, "operably linked" DNA 
sequences are contiguous, an4 in the case of a secretory leader or other polypeptide sequence, 
contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
accOT5)lished by Kgation at convenient restriction sites. If such sites do not exist, the synthetic 
oligonucleotide ads^tors or linkers are used in accordance with conventional practice. Promoter 
sequences encode either constitutive or inducible promoters. The promoters may be either natuially 
occurring promoters or hybrid promoters. Hybrid promoters, v^ch combine elements of more than 
one promoter, are also known in the art, and are usefid in the present invention. The esqjression 
vector may comprise additional elements, for example, the expression vector may have two 
replication systems, thus allowing it to be maintained in two organisms, for example in mammaliaTi 
or insect cells for expression and in a procaryotic host for cloning and amplification. Both expression 
and clonizig vectors contain a nucleic acid sequence that enables the vector to repUcate in one or 
more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of repUcation from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 
2: plasmid origm is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV 
or BPV) are useful for cloning vectors in mammalian cells. Further, for integrating e7q)ression 
vectors, the e?q}ression vector contains at least one sequence homologous to the host cell genome, and 
preferably, two homologous sequences which flank the expression construct. The integrating vector 
may be directed to a specific locus in the host cell by selecting the appropriate homologous sequence 
for inclusion in the vector. Constructs for int^rating vectors are well known in the art. In an 
additional embodiment, a heterologous expression control element may be operably linked with the 
endogenous gene in the host cell by homologous recombination (described in US Patents 6410266 
and 6361972, disclosures of which are hereby incorporated by reference in their entireties). This 
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technique allows one to regulate expression to a desired level wifh a chosen control element vAalt 
ensuring proper processing and modification of CPP endogenously expressed by the host cell. 
Useful heterologous expression control elements include but are not limited to CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters of 
retroviral LTRs, such as those of the Rous Sarcoma Virus (RSV), and metallofhionein promotes. 

Preferably, the expression vector contains a selectable marker gene to allow the selection of 
transformed host cells. Selection genes are well known ni the art and will vary with the host cell 
used. E^qpression and cloning vectors will typically contain a selection gene, also termed a selectable 
marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other 
toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic 
deficiencies, or (c) supply critical nutrients not available for firom complex media, e.g., the gene 
encoding D-alanine racemase for Bacilli. 

Host cells transformed with a nucleotide sequence encoding a CPP may be cultured under 
conditions suitable for tiie e3q}ressiQn and recovery of the encoded protein firom cell culture. The 
protein produced by a recombinant cell may be secreted, membrane-bound, or contained 
intracellidarly depending on the sequence and/or the vector used. As will be xmderstood by those of 
skill in the art, expression vectors containing polynucleotides encoding the CPP can be designed with 
signal sequences v^ch direct secretion of Ihe CPP through a prokaryotic or eukaryotic cell 
membrane. The desired CPP may be produced recombinantly not only directly, but also as a fiision 
polypeptide with a heterologous polypeptide, which may be a signal sequence or other polypeptide 
having a specific cleavage site at the N-terminus of the matmre protein or polypeptide. In general, the 
signal sequence may be a compouCTt of the vector, or it may be a part of the CPP-encoding DNA . 
that is inserted into the vector. The signal sequence may be a prokaryotic signal sequence selected, 
for example, firom the group of tiie alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin 
n leaders. For yeast secretion the signal sequence may be, e.g., the yeast invertase leader, alpha 
fector leader (including Saccharomyces and Kluyveromyces a-fector leaders, the latter described in 
U.S. Pat. No. 5,010,182), or acid phosphatase leader, the C albicans glucoamylase leader (EP 
362,179 published Apr. 4, 1990), or the signal described in WO 901 13646 pubUshed Nov. 15, 1990. 
In mammalian cell expression, mammaUan signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well 
as viral secretory leaders. According to the e3q)ression system selected, the coding sequence is 
inserted into an appropriate vector, which in tum may require the presence of certain characteristic 
"control elements" or "regulatory sequences." Appropriate constructs are known generally in the art 
(Ausubel, et al., 1990) and, in many cases, are available firom commercial suppliers such as 
Mvitrogen (San Diego, Calif.), Stratagene (La Jolla, Calif.), Gibco BRL OElockville, Md.) or 
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Ejqn-ession in Bacterial Systems 

Transfonnation of bacterial cells may be achieved using an inducible promoter such as the 
5 hybrid lacZ promoter of the 'TBLUESCRIPT" Phagemid (Stratagene) or "pSPORTl" (Gibco BRL). 
In addition, a number of expression vectors may be selected for use in bacterial cells to produce 
cleavabie fusion proteins that can be easily detected and/or purified, including, but not limited to 
'TBLUESCRIPT' (a-galactosidase; Stratagene) or pGEX (glutathione S-transferase; Promega, 
Madison, Wis.). A smtable bacterial promoter is any nucleic acid sequence capable of binding 
10 bacterial KNA polymerase and initiating the downstream (3 ') transcription of the coding sequence of 
the CPP gene into mKNA. A bacterial promoter has a transcription initiation region which is usually 
placed proximal to the S' end of the codmg sequence. This transcription initiation r^on typically 
includes an RNA polymerase binding site and a transcription initiation site. Sequences encoding 
metabolic pathway enzymes provide particularly useful promoter sequences. Examples include 
15 promoter sequences derived from sugar metabolizing enzymes, such as galactose, lactose and 
maltose, and sequences derived from biosynthetic enzymes such as tryptophan. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters and hybrid 
promoters are also usefiil; for exanq)le, the tat promoter is a hybrid of the trp and lac promoter 
sequences. Furdiennore, a bacterial pronM>ter can include naturaUy occurring promote 
20 bacterial origin that have the abiUty to bind bacterial KNA polymerase and initiate transcription. An 
efficient ribosome-binding site is also desirable. The expression vector may also include a signal 
pq)tide sequence that provides for secretion of the CPP in bacteria. The signal sequence typically 
encodes a signal pqptide comprised of hydrophobic amino acids which direct the secretion of the 
protein from the cell, as is well known in the art. The protein is either secreted into the growth media 
25 (gram-positive bacteria) or into the periplasmic space, located between the inner and outer membrane 
of the cell (gram-negative bacteria). The bacterial e^qjression vector may also include a selectable 
marker gene to allow for the selection of bacterial strains that have been transformed. Suitable 
selection genes include drug resistance genes such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, such as 
30 those in the histidine, tryptophan and leucine biosynthetic pathways. Vfhea large quantities of CPPs 
are needed, e.g., for the induction of antibodies, vectors which direct high level e3q)ression of fusion 
proteins tiliat are readily purified may be desirable. Such vectors include, but are not limited to, 
multifunctional E. coll cloning and e3q)ression vectors such as BLUESCRIPT (Stratagene), in which 
the CPP coding sequence may be Ugated into the vector in-firame with sequences for the amino- 
35 . terminal Met and the subsequent 7 residues of beta-galactosidase so that a hybrid protdn is 
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produced; PIN vectors (Van Heeke & Schuster JBiol Chem 264:5503-5509 1989)); PET vectors 
(Novagen, Madiscm Wis.); and Uie like. E;q>ressiQa vectors for bacteria include the various 
coznponeats set forth above, and are well known in the art. Examples include vectors for Bacillus 
subtilis, K coll. Streptococcus cremoris, and Streptococcus lividcms, among others. Bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in the art, 
such as calcium diloride mediated transfection, electroporation, and others. 

Expression in Yeast 

Yeast expression systems are well known in the art, and include escpression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
Kluyveromycesfragilis and K, lactis, Pichia guillermondii and P pastoris, Schizosaccharomyces 
pombe, and Yarrowia lipolytica. Examples of suitable promoters for use in yeast hosts include the 
promoters for 3-phosphoglycerate kinase (Hitzeman et al., J. Biol Chem. 255:2073 (1980)) or other 
glycolytic enzymes (Hess et al. J. Adv, Enzyme Reg. 7:149 (1968); Hofland, Biochemistry 17:4900 
(1978)), such as enolase, glyceraldehyde-3- phosphate dehydrogenase, hexokinase, pyruvate 
decarboxylase, phosphofiructokinase, glucose- 6-phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, alpha fector, the 
ADH2IGAPDH promoter, glucokinase alcohol oxidase, and PGH. See, for example, Ausubel, et al., 
1990; Grant et al.. Methods in Enzymology 153:5 16-544, (1987). Other yeast promoters, which are 
inducible have the additional advantage of transcription controlled by growth conditions, include the 
promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative 
enzymes associated wi<h nitrogen metabolism, metallothionein, gIyceraldehyde-3-phosphate 
dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast ejqpression are further described in BP 73,657. Yeast selectable markers 
include ADE2. HIS4. LEU2. TRPl. and ALG7, vMda. confers resistance'to tunicamycin; the 
neomycin phosphotransferase gene, whidi confers resistance to G4 18; and the CUPl gene, which 
allows yeast to grow in the presence of copper ions. Yeast expression vectors can be constructed for 
intracellular production or secretion of a CPP from the DNA encoding the CPP of interest. For 
example, a selected signal peptide and the appropriate constitutive or inducible promoter may be 
faiserted into suitable restriction sites in the selected plasmid for direct intracellular e3q}ression of the 
CPP. For secretion of the CPP, DNA encoding the CPP can be cloned into the selected plasmid, 
together widi DNA encoding the promoter, the yeast alpha-&ctor secretory signal/leader sequence, 
and linker sequraoes (as needed), for ejq)ressian of the CPP. Yeast cells, can then be transfonned 
with the expression plasmids described above, and cultured in an appropriate fisrmentation media. 
The protein produced by such transformed yeast can thra b e concentrated by precipitation with 10% 
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trichloroacetic acid and analyzed foUoiTving separation by SDS-PAGE and staining of the gels vn&i 

Coomassie Blue stain. The recombinant CP? can subset 
fermentation medium by techniques known to those of skill in the art. 

5 Expression in Mammalian Systems 

The CPP may be expressed in mammalian cells. Mammalian expression systems are known 
in the art, and include retroviral vector mediated ejqpression systems. Mammalian host cells may be 
transformed with any of a nmnber of diferent viral-based expression systems, such as adenovirus, 
where the coding region can be ligated into an adenovirus transcription/translation complex 
10 consisting of the late promoter and tripartite leader sequence. Insertion in a nonessential El or E3 
region of the viral genome results in a viable virus capable of expression of the polypeptide of 
interest in infected host cells. A preferred e>q)ression vector system is a retroviral vector system such 
as is generally described in PCTAJS97/01019 and PCT/US97/101048. Suitable mammalian 
expression vectors contain a mammaUan promoter which is any DNA sequence capable of binding 
15 mammalian RNA polymerase and initiating the downstream (3 ') transcription of a coding sequence 
for CPP into mRNA. A promoter will have a transcription initiating region, which is usually placed 
proximal to the 5 ' end of the coding sequence, and a TATA box, using a located 25-30 base pairs 
upstream of the transcription initiation site. The TATA box is thought to direct KNA polymerase n 
to begui KNA synthesis at the correct site. A mammalian promoter will also contain an upstream 
20 promoter element (enhancer element), typically located within 100 to 200 base pairs upstream of the 
TATA box. An upstream promoter element determines the rate at which transcription is initiated and 
can act in either orientation. Of particular use as mammalian promoters are the promoters from 
mammalian viral genes, sincethe viral genes are often hisihlye?q>ressed and have a broad h^ 
Exanq)les include promoters obtained firom the genomes of viruses such as polyoma virus, fowlpox 
25 vmis (UK 2,21 1, 504 published Jul. 5,1989), adenovirus (such as Adenovirus 2), bovine papilloma 
virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 
(SV40), from heterologous mammalian promoters, e.g., the actin promoter or an immunoglobulin 
promoter, and from heat-shock promoters, provided such promoters are compatible with the host cell 
systems. Transcription of DNA encoding a CPP by higher eukaryotes may be increased by inserting 
30 an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 
10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now 
known from manamalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, 
however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 
enhancer, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
35 repUcation origin, and adenovirus enhancers. The enhancer is preferably located at a site 5' from the 
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promoter. In general, the transcription termination and polyadeiq^lation sequences recognized by 
mannmaHgn cells are regulatory regions located 3' to tbe translation stop codon and thus, together 
with the promoter elements, flank the coding sequence. The 3' terminus of the mature mKNA is 
formed by site-specific post-translational cleavage and polyadenylation. Examples of transcription 

5 terminator and polyadenylation signals include those derived firomSV40. Long term, high-yield 

production of recombinant proteins can be effected in a stable expression system. Expression vectors 
which contain viral origins of replication or endogenous expression elements and a selectable marker 
gene may be used for this purpose. Appropriate vectors containing selectable markers for use in 
TTiflmTnalian cells are readily available commercially and are known to persons skilled in the art. 

10 Examples of such selectable markers include, but are not limited to herpes simplex virus thymidine 
kinase and ^r^emmR phosphoribosyltransferase for use in tk- or hprt-cells, respectively. The methods 
of introducing exogenous nucleic acid into mammalian hosts, as well as other hosts, is well known in 
the art, and will vary with the host cell used. Techniques include dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, electroporation, 

15 viral infection, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the 
DNA into nuclei. 

CPPs can be purified &om culture supematants of mammalian cells transiently transfected 
or stably transformed by an expression vector carrying a CPP-encoding sequence. Preferably, CPP 
is purified from culture supematants of COS 7 cells transiently transfected by the pcD e2q>ression 
20 vector. Transfection of COS 7 cells with pcD proceeds as fi>llows: One day prior to transfection, 

approximately 10^ COS 7 monkey cells are seeded onto individual 100 mm plates in Dulbecco's 
modified Eagle medium (DME) containing 10% fetal calf serum and 2 mM glutamine. To perform 
the transfection, the medium is aspirated from each plate and replaced with 4 ml of DME containing 
50 mM Tris.HCl pH 7.4, 400 mg/ml DEAE-Dextran and 50 jjg of plasmid DNA. The plates are 

25 incubated for four hours at 37^C, then the DNA-containing medium is removed, and the plates are 
washedtwice with 5 ml of serum-free DME. DME is added back to the plates "v^ch are then 

incubated for an additional 3 hrs at 370C. Hie plates are washed once with DME, after which DME 
containing 4% fetal calf serum, 2 mM glutamine, penicillin (100 U/L) and s trep tom ycin (100 \is/L) 
at standard concentrations is added. The cells are then incubated for 72 hrs at 31^C, after which the 
30 growth medium is collected for purification of CPP . Plasmid DNA for the transfections is obtained 
by growing pcD(SRa), or like expression vector, containing the CPP-encoding cDNA insert in E. 
coU MC1061, described by Casadaban and Cohen, J. Mol. Biol., Vol. 138, pgs. 179-207 (1980), or 
like organism. The plasmid DNA is isolated from the cultures by standard techniques, e.g. 
Sambrook et al.. Molecular Cloning: A Laboratory Manual, Second Edition (Cold Spring Haifoor 
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Laboratory, New York, 1989) or Ausubel et al (1990, cited above). 

Expression in Insect Cells 

CPPs may also be produced in insect cells. Expression vectors for the transformation of 

5 insect cells, and in particular, baculovirus-based expression vectors, are well known in the art. Li one 
such system, the CPP-encoding DNA is fused upstream of an epitope tag contained within a 
baculovirus expression vector. Autographa califomica nuclear polyhedrosis virus (AcNPV) is used . 
as a vector to express foreign genes in Spodoptera frugiperda Sf9 cells or in Trichoplusia larvae. 
The CPP-encoding sequence is cloned into a nonessential region of the virus, such as the polyhediin 

10 gene, and placed under control of the poljiiedrin promoter. Successful insertion of a CPP-encoding 
sequence will render the polyhedriix gme inactive and produce recombinant virus lacking coat protein 
coat. The recombinant viruses are then used to infect 5. frugiperda cells or Trichoplusia larvae in 
which the CPP is e?q)ressed (Smith et al., J. Wol. 46:584 (1994); Engelhard E K et al., Proc. Nat 
Acad. Set 91:3224-3227(1994)). Suitable epitope tags for fusion to the CPP-encoding DNA 

15 include poly-his tags and inmiunoglobulia tags (like Fc regions of IgG). A variety of plasmids may 
be employed, including commercially available plasmids such as pVL1393 (Novagen). Briefly, the 
CPP-encoding DNA or the desired portion of the CPP-encoding DNA is amplified by PCR with 
primers complementary to the 5* and 3* regions. The 5' primer may incorporate flanking restriction 
sites. The PCR product is then digested witli the selected restriction enzymes and subcloned into an 

20 expression vector. Recombinant baculovirus is graierated by co-transfecdng the above plasmid and 
BaculoGoldTM virus DNA (Pharmingen) into Spodoptera frugiperda ("S£9") cells (ATCC CRL 
1711) using lipofectin (commercially available firom GIBCO-BRL), or other methods known to those 
of skill in the art. Virus is produced by day 4-5 of culture in Sfi? cells at 28*'C, and used for further 
amplifications. Procedures are pCTformed as fimher described in O'Reilley et al., BACULOVIRUS 

25 EXPRESSION VECTORS: A LABORATORY MANUAL, Oxford University Press (1994). Extracts 
may be prepared from recombinant virus-infected Sf9 cells as described in Rupert et al.. Nature 
362:175-179 (1993), Alternatively, e3q)ressed epitope-tagged CPP can be purified by afiBnity 
chromatography, or for example, purification of an IgG tagged (or Fc tagged) CPP can be performed 
using chromatography techniques, including Protein A or protein G colunm chromatography. 

30 

Evaluation of Gene Expression 

Gene expression may be evaluated in a sample directly, for example, by standard techniques . 
known to those of skill in the art, e.g.. Northern blotting to determine the transcription of mRNA, dot 
blotting (DNA or RNA), or in situ hybridization, using an appropriately labeled probe, based on the . 
35 sequences provided herdiL Alternatively, antibodies may be used in assays for detection of 
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polypeptides, nucleic acids, such as specific duplexes, including DNA duplexes, RNA duplexes, and 

DNA-RNA hybrid duplexes or DNA-piotein duplexes. Such antibodies may be labeled and the assay 

carried out where the duplex is bound to a sur&ce, so that upon the formation of dupl% on the 

sur£Eu:e, the presence of antibody bound to the dupl^ can be detected. Gene expression, 

alternatively, may be measured by immunohistocheniical staining of cells or tissue sections and assay 

of cell culture or body fluids, to directly evaluate the expression of a CPP polypeptide or 

polynucleotide. Antibodies useful for such immunological assays may be either monoclonal or 

polyclonal, and may be prepared against a native sequence CPP. Protein levels may also be detected 

by mass spectrometry. A further method of protein detection is with protein chips. 

Purification of Expressed Protein 

Expressed CPP may be purified or isolated after ejqpression, using any of a variety of 
methods known to those skilled in the art. The appropriate technique will vary depending upon what 
other components are present in the sample. Contaminant components that are removed by isolation 
or pmification are materials that would typically interfere with diagnostic or therapeutic uses for the 
polypeptide, and may include enzymes, hormones, and other solutes. The purification step(s) selected 
will depend, for example, on the nature of the production process used and the particular CPP 
produced. As CPPs are secreted, they may be recovered from culture medium. Alternatively, the 
CPP may be recovered from host cell lysates. If membrane-bound, it can be released from the 
membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic cleavage. 
Alternatively, cells employed in expression of CPP can be disrupted by various physical or chemical 
means, such as freeze-thaw cycling, sonication, mechanical disruption, or by use of cell lysing 
agents. Exemplary purification methods include, but axe not limited to, ion-exchange colunon 
chromatography; chromatography using silica gel or a cation-exchange resin such as DEAE; gel 
filtration using, for example, Sqphadex G-75; protein A Sepharose columns to remove conta m ina n ts 
such as IgG; chromatography using metal chelating columns to bind epitope-tagged forms of the 
CPP; ethanol precipitation; reverse phase HPLC; chromatofocusing; SDS-PAGE; and aihmonium 
sulfete precipitation. Ordinarily, an isolated CPP will be prepared by at least one purification step. 
For example, the CPP may be purified using a standard anti-CPP antibody column. Ultrafiltration 
and dialysis techniques, in conjunction with protein concentration, are also useful (see, for example. 
Scopes, R., PROTEIN PURIFICATION, Springer-Verlag, New York, N.Y., 1982). The degree of 
purification necessary will vary dependiag on the use of the CPP. Li some instances no purification 
will be necessary. Once expressed and purified as needed, the CPPs and nucleic acids of the present 
invention are useful in a number of appUcations, as detailed herein. 
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Transgenic animals 

The host cells of the inventioa can also be used to produce nonhuinan transgenic animals. 
For example, in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic 
stem cell into which CPP-coding sequences have been introduced. Such host cells can then be used to 

S create non-human transgenic animals in which exogenous CPP sequences have been introduced into 
their genome or homologous recombinant animals in vMch endogenous CPP sequences have been 
altered. Such animals are useful for studying the function and/or activity of a CPP or fragm ent 
thereof and for identifying and/or evaluating modulators of CPP biological activity. As used herein, a 
'^transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent such as 

10 a rat or mouse, in which one or more of the cells of the animal include a trausgene. Other examples 
of transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, 
etc. A transgene is exogenous DNA which is integrated into the genome of a cell from which a 
transgenic animal develops and which remains in the genome of the mature animal, thereby directing 
the e7q)ression of an encoded gene product in one or more ceD types or tissues of the transg^c 

15 animal. As used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule introduced into a cell 
of the animal, e.g., an embryonic cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a CPP-encoding nucleic 

20 add into the male pronuclei of a fertilized oocyte, e.g., by microinjection or retroviral infection, and 
allowing the oocyte to develop in a pseudopregnant female foster animal. The CPP cDNA sequence 
or a fragment thereof can be introduced as a transgene into the genome of a non-hmnan animal. 
Altematively, a nonhuman homologue of a human CPP-encoding gene, such as fix>m mouse or rat, 
can be used as a transgene. hxtronic sequences and polyadenylation signals can also be included in 

25 the transgene to increase the eflSciency of expression of the transgene. A tissue-specific regulatory 
sequCTce(s) can be operably linked to a CPP transgene to direct expression of a CPP to particular 
cells. Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are described, for 
example, in U.S. Pat. Nos. 4,736,866 and 4,870,009, both by Leder et al., U.S. Pat. No. 4,873,191 
• 30 by Wagner et al. and in Hogan, B., Manipulating the Mouse Embryo, (Cold Spring Harbor 

Laboratory Press, Cold Spring Harbor, N.Y., 1986, the disclosure of which is incorporated herein by 
reference in its entirety). Similar methods are used for production of other transgenic animals. A 
transgenic founder animal can be identified based upon the presence of a CPP transgene in its 
genome and/or expression of CPP mKNA in tissues or cells of the animals. A transgenic feunder 

35 animal can then be used to breed additional animals carrying the transgene. Moreover, transgenic 
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animals carrying a transgene encoding a CPP can further be bred to other transgenic animals 
carrying other transgenes. 

To create an animal in which a desired nucleic acid has been introduced into the genome via 
homologous recombination, a vector is prepared which contains at least a portion of a CPP-encoding 
5 sequence into vMch a deletion, addition or substitution has been introduced to thereby alter, e.g., 
functionally disrupt, the CPP-encoding sequrace. The CPP-encoding sequence can be a human gene, 
but more preferably, is a non-human homologue of a human CPP-encoding sequence (e.g., a cDNA 
isolated by stringent hybridization with a nucleotide sequence coding for a CPP). For example, a 
mouse CPP-encoding seqpience can be used to construct a homologous recombination vector suitable 
10 for altering an endogenous gene in the mouse genome. In a preferred embodiment, the vector is 
designed such tiiat, upon homologous recombination, the endogenous CPP-encodmg sequence is 
fonctionally disrupted (i.e., no longer encodes a functional protein; also referred to as a **knock out" 
vector). Alternatively, the vector can be designed such that, upon homologous recombination, the 
endogenous CPP-encoding sequence is mutated or otherwise altered but still encodes functional 
15 protein (e.g., the upstream regulatory region can be altered to thereby alter the e3q>ression of the 

endogenous CPP-encoding sequence). In the homologous recombination vector, the altered portion of 
the CPP-encoding sequence is flanked at its 5' and 3' ends by additional nucleic acid sequence of the 
CPP gene to allow for homologous recombination to occur between the exogenous sequence carried 
by the vector and an endogenous gene in an embryonic stem cell. The additional flanking nucleic acid 
20 sequence is of sufSdent length for successful homologous recombination with the endogenous gene. 
Typically, several kilobases of flanking DNA (both at the 5' and 3* ends) are included in the vector 
(see e.g., Thomas, K. R. and Capecchi, M. R. (1987) Cell 51:503, the disclosure of which is 
incorporated herein by reference in its entirety, for a description of homologous recombination 
vectors). The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells 
25 in which the introduced CPP-encoding sequence has homologously recombined with the endogenous 
gene are selected (see e.g., Li, E. et al. (1992) Cell 69:915, the disclosure of which is incorporated 
herein by reference in its entirety). The selected cells are then injected into a blastocyst of an animal 
(e.g., a mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and 
Embryonic Stem Cells. A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987) pp. 1 13- 
30 152, the disclosure of lAdiich is incorporated herein by reference in its entirety). A chimeric embryo 
can then be implanted into a suitable pseudoprpgnant female foster animal and the embryo brought to 
term. Progeny harboring the homologously recombined DNA in tiieir germ cells can be used to breed 
animals in v(iiich all cells of tiie animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination vectors and 
35 homologous recombinant animals are described further in Bradley, A. (1991) Current Opinion in 
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Biotechnology 2:823-829 and in PCT latemational Publication Nos.: WO 90/1 1354 by Le Mouellec 
et al.; WO 91/01 140 by Smithies et al.; WO 92/0968 by Zijlstra et al.; and WO 93/04169 by Bems 
et al., the disclosures of whidi are incoiporated herein by reference in their entireties. 

In another embodiment, transgenic non-hunian animals can be produced which contain 
5 selected systems which allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI. For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al. (1992) PNAS 89:6232-6236, the disclosure of which is 
incorporated herein by reference in its entirety. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (O'Gonnan et al. (1991) Science 251:1351-1355, 
10 the disclosure of which is incorporated herein by reference in its entirety). If a cr^oxP recombinase . 
system is used to regulate expression of the transgene, animals containing transgenes encoding both 
the Cre recombinase and a selected protein are required* Such animals can be provided through the 
construction of "double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and tiie other containing a transgaoe encoding a lecQmbioase. * " 

15 

Assessing CPP activity 

It will be appreciated that the invention further provides methods of testing the activity of or 
obtaining functional fiagments and variants of CPPs and CPP sequences. Such methods involve 
providing a variant or modified CPP-encoding nucleic acid and assessing whether the encoded 

20 polypeptide displays a CPP biological activity. Encompassed is thus a method of assessing the 

function of a CPP comprising: (a) providing a CPP, or a biologically active i&agment or homologue . 
thereof; and (b) testing said CPP, or a biologically active firagment or homologue thereof for a CPP 
biological activity. Any suitable format may be used, including cell fiiee, cell-based and in vivo 
formats. For example, said assay may comprise e3q)ressing a CPP nucleic acid in a host cell, and 

25 observing CPP activity in said cell and other afifected cells. In another example, a CPP, or a 
biologically active fiagment or homologue thereof is contacted with a cell, and a CPP biological 
activity is observed, 

A CPP biological activity may be any activity as described herein, such as (1) protection 
fi-om cardiovascular disease; (2) circulating througji the bloodstream; (3) antigemcity, or the abihty 
30 to bind an anti-CPP specific antibody; (4) immunogenicity, or the ability to gienerate an anti^CPP 
specific antibody; or (5) interacting with a CPP target molecule, preferably a protein. 

CPP biological activity can be assayed by any suitable method known in the art, 
Antigemcity and immunogenidty may be detected, for example, as described in the sections titled 
"Anti CPP antibodies" and *Uses of CPP antibodies". Circulation in blood plasma may be detected 
35 as described in **Diagnostic and Prognostic Uses". Itxteraction with a CPP target molecule may be 
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detected according to any of fhe methods described herein, for example, in ^e section titled 'Drug 
Screening Assays". 

Cardiovascular disorders may be diagnosed by any method determined appropriate for aa 
individual by one of skill in the art. Further examples of symptoms and diagnostics may be fomd in 

5 the Backgroimd section, and are best determined appropriately by one of skill in the art based on the 
particular profile of a patient. 

Intramolecular interactions may be detected by sequence-based structural predictions. Such 
predictions are generally based on X-ray crystallography or NMR structural data for a polypeptide 
with similar sequence. Detection of intramolecular interactions may also be accomplished using 

10 SDS-PAGE. For the example of disulfide bonds, links formed between diGfermt portions of a given 
protein result in a more compacted protein, and thus, a reduced apparent molecular weight. 
Disulfide bonds may be disrupted by a reducing agent, for example, dithioOtreitol (DTT). A protein 
sanq}le that has been treated with a reducing agent may thus be compared to an untreated control by 
SDS-PAGE to detect a change in apparent molecular weight. Such methods are common to the art. 

15 Amidation may be detected by conq)aring the molecular weigjht of a sample peptide to lhat of 

an amidated form of the same peptide. The amidated form may be prepared according to conmion 
methods, for example, as disclosed in US Patent 4708934. Molecular weights are easily compared 
according to any method common to the art such as SDS-PAGE, gel chromatography, or mass 
spectrometry. Proteolysis may also be detected by conq>aring the molecular weight of a sanq)le 

20 peptide to that of a peptide of known molecular weight. Preferably, the molecular weight of a test 
pq)tide is obtained by mass spectrometry and compared to a database comprisiog molecular weights 
of peptides with posttranslational modifications. Exemplary databases include Genpept, 
SWISSPROT, EMBL, and the Protein Sequence Database. Such techniques are detailed fiirther 
herein. 

25 

Anti-CPP Antibodies 

The present invention provides antibodies and binding compositions specific for CPPs. Such 
antibodies and binding compositions include polyclonal antibodies, monoclonal antibodies. Fab and 
single chain Fv fragments thereof, bispedfic antibodies, heteroconjugates, hmnanized antibodies, and 
30 the like. Such antibodies and binding conq)ositions may be produced in a variety of ways, including 
hybridoma cultures, recombinant e7q)ression in bacteria or mammalian cell cultures, recombinant 
esqpression in transgenic animals, and the like. There is abundant guidance in the literature for 
selecting a particular production methodology, e.g. Chadd and Chamow, Curr. Opin. Biotechnol., 
12: 188-194 (2001). 

35 The choice of manu&cturing methodology depends on several factors including the antibody 
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Structure desired, the importance of carbohydrate moieties on the antibodies, ease of culturing and 
purification, cost, and the like. Many different antibody structures may be generated using standard 
e?q)ression technology, including full-length antibodies, antibody fragments, such as Fab and Fv 
fragments, as well as chimeric antibodies comprising components firom different species. Antibody 

5 fragments of small size, such as Fab and Fv fragments, having no effector functions and limited 

pharmokinetic activity may be generated in a bacterial expression system. Single chain Fv fragments 
are highly selective for in vivo tumors, show good tumor penetration and low immunogenicity, and 
are cleared rapidly from the blood, e.g. Freyre et al, J. BiotechnoL, 76: 157-163 (2000). Thus, such 
molecules are desirable for radioimmunodetection and in situ radiotherapy. Whenever 

10 pharmacokinetic activity in the form of increased half-life is required for therapeutic purposes, then 
fuU-lengtfa antibodies are preferable. For example, the immunoglobulin G (IgG) molecule may be 
one of four subclasses: yl, 72, y3, or y4. If a fuU-leagth antibody with effector function is required, 
then IgG subclasses yl or y3 are preferred, and IgG subclass yl is most preferred. The y 1 and y3 
subclasses exhibit potent effector function, complement activation, and promote antibody-dependent 

15 cell-mediated cytotoxicity throu^ interaction with specific Fc receptors, e.g. Raju et al, 
Glycobiology, 10: 477-486 (2000); Lund et al, J. Immunol., 147: 2657-2662 (1991). 



Polyclonal Antibodies 

The anti-CPP antibodies of the present invention may be polyclonal antibodies. Such 

20 polyclonal antibodies can be produced in a mammal, for example, following one or more injections 
of an immxmizing agent, and preferably, an adjuvant. Typically, the immunizing agent and/or 
adjuvant will be injected into the mammal by a series of subcutaneous or intraperitoneal injections. 
The I'tnmiiniTi'ng agent may include CPPs or a fusion protein thereof It may be usefiil to conjugate 
the antigen to a protein known to be immunogenic in the mammal being immunized. Examples of 

25 such immunogenic proteins include, but are not limited to, keyhole limpet hemocyanin (KLH), 
methylated bovine serum albumin (mBSA), bovine serum albumin (BSA), Hepatitis B surfece 
antigen, serum albumin, bovine IhyroglobuUn, and soybean trypsin inhibitor. Adjuvants include, for 
example, Freund's oranplete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicoryno-mycolate). The immunization protocol may be determined by one skilled in the art 

30 based on standard protocols or by routine experimentation. 

Alternatively, a crude protein preparation which has been enriched fi)r a CPP or a portion 
thereof can be used to generate antibodies. Such proteins, firagmeixts or preparations are introduced 
into the non-human mammal in the presence of an appropriate adjuvant. If the serum contains 
polyclonal antibodies to undesired ^itqpes, the polyclonal antibodies are purified by iimnunoafBnity 

35 duomatography. 
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Efifective polyclonal antibody production is affected by many &ctors related both to the 
antigen and the host species. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate and excessive doses of antigen resulting in low titer antisera. Small doses (ng 
level) of antigen administered at multiple intradermal sites appear to be most reliable. Techniques for 

5 producing and processing polyclonal antisera are known in the art, see for example, Mayer and 
Walker (1987), the disclosure of which is incorporated herein by reference in its entirety. An 
effective immimization protocol for rabbits can be found in Vaitukaitis, J. et al, J. Clin. Endocrinol. 
Metab. 33:988-991(1971), the disclosure of which is incorporated by reference in its entirety. 
Booster injections can be given at regular intervals, and antiserum harvested when antibody titer 

10 thereof as determined semi-quantitatively, for example, by double immunodifiEiision in . agar against 
known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, O. et al.. Chap. 
19 in: H^dbook of Experimental Immunology D, Wier (ed) Blackwell (1973). jPlateau concentration 
of antibody is usually in the range of 0.1 to 0.2 mg/ml of senun (about 12: M). Affinity of the 
antisera for the antigen is determined by preparing competitive binding curves, as described, for 

15 exaaq>le, by Fisher, D., Ch^. 42 in: Manual of Clinical Lnraunology, 2d Ed. (Rose and Friedman, 
Eds.) Amer. Soc. For Microbiol., Washington, D. C. (1980). 

Monoclonal Antibodies 

Alternatively, the anti-CPP antibodies may be monoclonal antibodies. Monoclonal antibodies 

20 may be produced by hybridomas, wherein a mouse, hamster, or other appropriate host a nimal, is 

inmiunized with an imm^initnng agent to elicit lynqphocj^ that produce or are capable of producing 
antibodies that will specifically bind to the hnmunizing agent, e.g. Kohler and Milstein, Nature 
256:495 (1975). The immunizing agent will typically include the CPP or a fusion protem thereof and 
optionally a carrier. Alternatively, the lymphocytes may be immunized in vitro. Generally, spleen 

25 cells or lymph node cells are used if non-human mammalian sources are desired, or peripheral blood 
lymphocytes (*TBLs") are used if cells of human origin are desired. The lymphocytes are fused with 
an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to produce a 
hybridoma cell, e.g. Goding, MONOCLONAL ANTIBODIES: PRINCIPLES AND PRACTICE, 
Academic Press, pp. 59-103 (1986); Liddell and Cryer, A Practical Guide to Monoclonal Antibodies 

30 (John Wiley & Sons, New York, 199 1); Malik and Lillenoj, Editors, Antibody Techniques 

(Academic Press, New York, 1994). In general, immortalized cell lines are transformed mammalian 
cells, for example, nayeloma cells of rat, mouse, bovine or human origin. The hybridoma cells are 
cultured in a suitable culture medium Ibat preferably contains one or more substances that inhibit the ' 
growth or survival of imfiised, immortalized cells. For example, if the parratal cells lack the enzyme 

35 hypoxandiine guanine phosphoribosyl transferase (HGPRT), the culture medium for the hybridomas 
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typically will include hypoxanthine, aminopterm, and thymidine (HAT), substances which prevent 
the growth of HGPRT-defideot cells. Preferred immortalized cell lines are Aose that fuse efBdenfly, 
support stable high level production of antibody, and are sensitive to a medium such as HAT 
medium. More preferred immortalized cell lines are murine or h\mian myeloma lines, which can be 
5 obtained, for example, from the American Type Culture Collection (ATCC), Rockville, MD. Human 
myeloma and mouse-human heteromyeloma cell lines also have been described for the production of 
human monoclonal antibodies, e.g. Kozbor, J. Immunol. 133:3001 (1984); Brodew et al.. 
Monoclonal Antibody Production Techniques and AppUcations, Marcel Dekker, Inc., New York, pp. 
51-63 (1987). 

10 The culture medium (supernatant) in which the hybridoma cells are cultured can be assayed 

for the presence of monoclonal antibodies directed against a CPP. Preferably, the binding specificity 
of monoclonal antibodies present in the hybridoma supernatant is determined by immunoprecipitation 
or by an in vitro binding assay, such as radio- immunoassay (RIA) or Enzyme-Linked Immune 
Sorbent Assay (ELISA). Appropriate techniques and assays are known in the art. The binding 

15 afiBnity of the monoclonal antibody can, for example, be determined by the Scatchard analysis of 
Munson and Pollard, Anal Biochem, 107:220 (1980). After the desired antibody-producing 
hybridoma cells are identified, the cells may be cloned by limiting dilution procedures and grown by 
standard methods (Coding, 1986, supra). Suitable culture media for this purpose include, for 
exan5)le, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the 

20 hybridoma cells may be grown in vivo as ascites in a mammal. The monoclonal antibodies secreted 
by selected clones may be isolated or purified fixim the culture medium or ascites fluid by 
immunoglobulin purification procedures routinely used by those of skill in the art such as, for 
example, protein A-Sepharose, hydroxyl-apatite chromatography, gel electrophoresis, dialysis, or 
« afBnity chromatography. 

25 The monoclonal antibodies may also be made by recombinant DNA methods, such as those 

described in U.S. Pat. No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can 
be isolated from the CPP-specific hybridoma cells and sequenced, e.g., by using oUgonucleotide 
probes that are capable of binding specifically to genes encoding the heavy and Ught chains of 
murine antibodies. Once isolated, the DNA may be inserted into an expression vector, which is then 

30 transfected into host cells sudi as simian COS cells, Chinese hamster ovary (CHO) cells, or 

myeloma cells that do not otiierwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also may be niodified, for example, 
by substituting the coding sequmce for the murine heavy and light chain constant domains for the 
homologous human sequences (Morrison et al., Proc, Nat AcacL Set 81:6851-6855 (1984); 

35 Neuberger et al„ Nature 3 12:604-608 (1984); Takeda et al.. Nature 3 14:452-454 (1985)), or by 

50 



wo 2004/061096 PCT/£P2004/000027 
covalenfly joiiiisg to tbe iixummoglobiilin codmg sequence all or part of the coding sequence for a 
* non-mununoglobulin polypeptide. The non-hnniunoglobulm polypeptide can be substituted for the 
constant domains of an antibody of tiie invention, or can be substituted for the variable domains of 
one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody. 
5 The antibodies may also be monovalCTt antibodies. Methods for preparing monovalent antibodies are 
well known in the art. For example, in vitro methods are suitable for preparing monovalent 
antibodies. Digestion of antibodies to produce fragments thereof, particularly. Fab fragments, can be 
accomplished using routine techniques known in the art. 

Antibodies and antibody fragments characteristic of hybridoncias of the invention can also be 
10 produced by recombinant means by extracting messenger RNA, constructing a cDNA library, and 
selecting clones which encode segments of the antibody molecule. The following are exemplary 
■ references disclosing recombiiianttedmiques for producing antibodies: Wall et al.. Nucleic Acids 
Research, Vol. 5, pgs. 3113-3128 (1978); Zakut et al.. Nucleic Acids Research, Vol. 8, pgs: 3591- 
3601 (1980); Cabilly et al., Proc. Natl. Acad. Sci., Vol. 81, pgs. 3273-3277 (1984); Boss et al., 
15 Nucleic Acids Research, Vol. 12, pgs. 3791-3806 (1984); Amster et al., Nucldc Acids Research, 
Vol. 8, pgs. 2055-2065 (1980); Moore et al., U.S. Patent 4,642,334; Skerra et al. Science, Vol. 240, 
pgs. 1038-1041(1988); Huse et al. Science, Vol. 246, pgs. 1275-1281 (1989); and U.S. patents 
6,054,297; 5,530,101; 4,816,567; 5,750,105; and 5,648,237; which patents are incorporated by 
reference. * In particular, sudi techniques can be used to produce interspecific monoclonal antibodies, 
20 wherein the binding region of one species is combined with non-binding region of the antibody of 

another species to reduce inamunogenicity, e.g. Liu et al., Proc. Natl. Acad. Sci., Vol. 84, pgs. 3439- 
3443 (1987), and patents 6,054,297 and 5,530,101. Preferably, recombinantly produced Fab and 
Fv fragments are expressed in bacterial host systems. Preferably, ftill-length antibodies are produced 
by mammalian cell culture techniques. More preferably, fiill-length antibodies are e3q)ressed in * 
25 Chinese Hamster Ovary (CHO) cells or NSO cells. 

Both polyclonal and monoclonal antibodies can be screened by ELISA. As in other solid 
phase immunoassays, the test is based on the tendency of macromolecules to adsorb nonspecifically 
to plastic. The irreversibility of this reaction, without loss of inamunological activity, allows the 
fonxiation of antigen-antibody complexes with a simple separation of such complexes from unbound 
30 material. To titrate antipeptide serum, peptide conjugated to a carrier different from that used in 
immunization is adsorbed to the wells of a 96-well microtiter plate. The adsorbed antigen is then 
allowed to leact in the wells with dilutions of anti-pqptide serum. Unbovmd antibody is wa^ed 
away, and the remaining antigen-antibody complexes are allowed to react with an antibody specific 
for the ]^ of the immunized animal. This second antibody is conjugated to an enzyme such as 
35 ?^llfa1iTift phosphatase. A visible colored reaction produced when the ^izyme substrate is added 
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indicates which wells have bound aatipeptide antibodies. The use of spectrophotometer readings 
allows better quantification of the amount of peptide-specific antibody bound. High-titer antisera 
yield a linear titration curve between lO'^ and 10"^ dilutions. 

5 CPP peptide carriers 

The invention includes immunogens derived firom CPPs, and inmunogens comprising 
conjugates between carriers and peptides of the invention. The term immimogen as used herein 
refers to a substance which is capable of causing an iimnune response. The term carrier as used 
herein refers to any substance which when chemically conjugated to a peptide of the invention 
10 permits a host organism immunized with the resulting conjugate to generate antibodies specific for 
the conjugated peptide. Carriers include red blood cells, bacteriophages, proteins, or synthetic 
particles such as agarose beads. Preferably, carriers are proteins, such as serum albumin, gamma- 
globulin, keyhole limpet hemocyanin, thyroglobulin, ovalbumin, fibrinogen, or the like. 

The general technique of linking syntheti6 peptides to a carrier is described in several 
15 references, e.g. Walter and Doolittle, "Antibodies Against Synthetic Peptides," in Setlow et al., eds.. 
Genetic Engineering, Vol. 5, pgs. 61-91 (Plenum Press, N.Y., 1983); Green et al. Cell, Vol. 28, pgs. 
477-487 (1982); Lemer et al., Proc. Natl. Acad. Sd., Vol 78, pgs. 3403-3407 (1981); Shimizu et 
al., U.S. Patent 4,474,754; and Ganfield et al., U.S. Patent 4,311,639. Accordingly, these references 
are incorporated by reference. Also, techniques employed to link haptens to carriers are essentially 
20 the same as the above-referenced techniques, e.g. chapter 20 in Tijssen, Practice and Theory of 
Enzyme Immunoassays (Elsevier, New York, 1985). The four most commonly used schemes for 
attaching a peptide to a carrier are (1) glutaraldehyde for amino coupling, e.g. as disclosed by 
Kagan and Glick, in Jaffe and Behrman, eds. Methods of Hormone Radioimmunoassay, pgs. 328- 
329 (Academic Press, N.Y., 1979), and Walter et al. Proc. Natl. Acad. Sci., Vol. 77, pgs. 5197- 
25 5200 (1980); (2) water-soluble carbodiimides for carboxyl to amino coupling, e.g. as disclosed by 
Hoare et al., J. Biol. Chem., Vol. 242, pgs. 2447-2453 (1967); (3) bis-diazobenzidine (BDB) for 
tyrosine to tyrosine sidechain coupling, e.g. as disclosed by Bassiri et al., pgs. 46-47, in Jaffe and 
Behrman, eds. (cited above), and Walter et al, (cited above); and (4) maleimidobenzoyl-N- 
hydroxysucdnimide ester (MBS) for coupling cysteine (or other sulfhydryls) to amino groups, e.g. as 
30 disclosed by Kitagawa et al., J. Biochem, (Tokyo), Vol. 79, pgs. 233-239 (1976), and Lemer et al. 
(dted above). A general rule for selecting an appropriate method for coupling a given pqitide'to a 
piotein carrier can be stated as follows: the group involved in attachment should occur only once in 
the sequence, preferably at the appropriate end of the spgment. For example, BDB should not be 
used if a tyrosine residue occurs in the main part of a sequence chosen for its potentially antigenic 
35 character. Sinoilarly, centraUy located lysines rule out the glutaralddiydem^ 
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occurrences of aspartic and glutamic acids frequently exclude the carbodiinude approach. On the 
other hand, suitable residues can be positioned at either end of chosen sequence segment as 
attachment sites, \!s^ether or not they occur in the "native" protein sequence. Internal segments, 
unlike the amino and carboxy termini, will differ significantly at the "unattached end" from the same 
5 sequence as it is found in the native protein where the polypeptide backbone is continuous. The 
problem can be remedied, to a degree, by acetylating the a-amino group and then attaching the 
peptide by way of its carboxy terminus. The coupling efficiency to the carrier protein is 
conveniently measured by using a radioactively labeled peptide, prepared either by using a 
radioactive amino acid for one step of the synthesis or by labeling the conq)leted peptide by the 
10 iodination of a tyrosine residue. The presence of tyrosme in the peptide also allows one to set up a 
sensitive radioiitmnme assay, if desirable. Therefore, tyrosine can be introduced as a terminal 
residue if it is not part of the peptide sequrace defined by the native polypeptide. 

Preferred carriers are proteios, and preferred protein carriers include bovine serum albumin, 
nxyoglobulin, ovalbumin (OVA), kejiiole linq)et hemocyanin (KUHQ, or the Uke. Peptides can be 
15 linked to KLH through cysteines by MBS as disclosed by Liu et al.. Biochemistry, Vol. 1 8, pgs. 
690-697 (1979). The peptides are dissolved in phosphate-buffered saline (pH 7.5), 0.1 M sodium 
borate buffer (pH 9.0) or 1 .0 M sodium acetate buffer 4.0). The pH for the dissolution of the 
peptide is chosen to optimize pqrtide solubility. The content of free cysteine for soluble peptides is 
determined by EUman's method, EUman, Arch. Biochem. Biophys., Vol. 82, pg. 7077 (1959). For 
20 each peptide, 4 mg KLH in 0.25 ml of 10 mM sodium phosphate buffer (pH 7.2) is reacted with 0.7 
mg MBS (dissolved in dimethyl formamide) and stirred for 30 min at room temperature. The MBS 
is added dropwise to ensure that the local concentration of formamide is not too higji, as KLH is 
insoluble in >30%fomiamide. The reaction product, KLH-MBS, is then passed through Sephadex • 
G-25 equilibrated with 50 mM sodium phosphate buffer (pH 6.0) to remove free MBS, KLH 
25 recovery from peak fractions of the column eluate (monitored by OD280) is estinoated to be 

approximately 80%. KLH-MBS is then reacted with 5 mg peptide dissolved in 1 ml of the chosen 
buffer. The pH is adjusted to 7-7.5 and the reaction is stirred for 3 hr at room temperature. 
Coupling efQciency is monitored with radioactive peptide by dialysis of a sample of the conjugate 
against phosphate-bufifered saline, and may range from 8% to 60%. Once the peptide-carrier 
30 coxyugate is available, polyclonal or monoclonal antibodies are produced by standard techniques, e.g. 
as disclosed by Campbell, Monoclonal Antibocfy Technology (Elsevier, New York, 1984); Hurrell, 
ed. Monoclonal Hybridoma Antibodies: Techniques and Applications (CRC Press, Boca Raton, FL, 
1982); Schreier et al. Hybridoma Techniques (Cold Spring Harbor Laboratory, New York, 1980); 
U.S. Patmt 4,562,003; or the like, la particular, U.S. Patent 4,562,003 is incorporated by reference. 
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Humcmized Antibodies 

The aati-CPP antibodies of the inveation may fiirther comprise humanized antibodies or 
human antibodies. The term 'Immanized antibody^' refers to humanized forms of non-human (e.g., 
murine) antibodies that are chimeric antibodies, immimoglobxilin chains or fragments thereof (such 
5 as Fv, Fab, Fab', F(ab'), or other antigen-binding partial sequences of antibodies) which contain 
some portion of the sequence derived from non-human antibody. Humanized antibodies include - 
human immunoglobulins in which residues from a complementary determining region (CDR) of the 
human immunoglobulin are replaced by residues from a CDR of a non-human species such as 
mouse, rat or rabbit having the desired binding specificity, afBnity and capacity. In general, the 
10 humanized antibody will comprise substantially all of at least one, and generally two, variable 

domains, in which all or substantially all of the CDR regions correspond to those of a non-huinan 
immunoglobulin and all or substantially all of the PR regions are those of a human immimoglobulin 
consensus sequence. The humanized antibody optimally also will comprise at least a portion of an 
immunoglobulin constant r^on (Fc), typically that of a human immunoglobulin (Jones et al., 
15 Nature 321:522-525 (1986) and Presta, Curr. Op. Struct Biol 2:593-596 (1992)). Methods fcr 
yy xftttm^-Txn^ uon-human antibodies are well known in the art. Generally, a humanized antibody has 
one or more amino acids introduced into it from a source which is non-human in oider to more 
closely resemble a human antibody, y*ile still retaining the original binding activity of the antibody. 
Methods for humanization of antibodies are further detailed in Jones et al,. Nature 321 :522-525 
20 (1986); Riechmann et al.. Nature 332:323-327 (1988); and Verhoeyen et al.. Science 239:1534- 

1536 (1988). Such 'liumanized" antibodies are chimeric antibodies in that substantially less than an 
intact human variable domain has been substituted by the corresponding sequence from a non-human 
species. 

25 Heteroconfugate Antibodies 

Heteroconjugate antibodies which comprise two covalently joined antibodies, are also within 
the scope of the present invention. Heteroconjugate antibodies may be prepared in vitro using known 
methods in synthetic protein chemistry, including those involving crosslinking agents. For example, 
immunotoxins may be prepared using a disulfide exchange reaction or by forming a thioether bond. 

30 

Bispeciftc Antibodies 

Bispecific antibodies have binding specificities for at least two different antigens. Such 
antibodies are monoclonal, and preferably human or humanized. One of the blading specificities of a 
bispecific antibody of the present invention is for a CPP, and the other one is preferably for a cell- 
35 sur&ce protein or receptor or receptor subunit. Methods for making bispecific antibodies are known 
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in fhe art, and in general, the recombinant production of bispecific antibodies is based on the co- 
expression of two inununoglobnlin heavy-chain/lights:hain pairs in hybridoma cells, \sdiere the two 
heavy chains have different specificities, e.g. Milsteia and Cuello, Nature 305:537-539 (1983). 
Given lliat the random assortment of inmiunoglobulin heavy and light chains results in production of -. 
5 potentially ten different antibody molecules by the hybridomas, purification of the correct molecule 
usually requires some sort of afBnity purification, e.g. afiBnity chromatography. 

Uses ofCPP antibodies 

CPP antibodies mscy be used as functioiial antagonists. Preferably, such antibodies are 

10 specific for CPPs. More preferably, the antagonists of the invention comprise fi:agments or binding 
compositions specific for CPPs. As used herein, the term "heavy chain variable region" means a 
polypeptide (1) which is firom 1 10 to 125 amino acids in length, and (2) whose amino acid sequence 
corresponds to that of a heavy chain of an antibody of the invention, starting firom the heavy chain's 
N-terminal amino acid. Likewise, the term "light chain variable region" means a polypeptide (1) 

15 which is from 95 to 115 amino adds in length, and (2) whose amino acid sequence conesponds to 
that of a lig^ chain of an antibody of the invention, starting fixm the ligfht dbain's N-terminal amino 
acid. As used herein the term "monoclonal antibody" refers to homogeneous populations of 
immunoglobulins v^toch are capable ofqiedficaHy binding to CP Preferably, antagonists of the 
invention are derived from monoclonal antibodies specific for CPPs. Monoclonal antibodies capable 

20 of blockixig, or neutralizmg, CPPs are selected by their ability to inhibit a biological activity of 
CPPs. 

The use of antibody firagments is also well known, e.g. Fab firagments: Tijssen, Practice and 
Theory of Enzyme Immunoassays (Elsevier, Amsterdam, 1985); and Fv firagmmts: Hochman et al. 
Biochemistry, Vol. 12, pgs. 1130-1135 (1973), Sharon et al.. Biochemistry, Vol. 15, pgs. 1591-1594 

25 (1976) and Ehrlich et al., U.S. Patart 4,355,023; and antibody half molecules: Auditore- 
Hargreaves, U.S. Patent 4,470,925 . 

Preferably, rnonoclonal antibodies, Fv fi:agments. Fab fragments, or other binding 
compositions derived from monoclonal antibodies of the invention have a high aflSnity to CPPs. The 
afiBnity of monoclonal antibodies and related molecules to CPPs may be measured by conventional 

30 techniques including plasmon resonance, ELISA, or equilibrium dialysis. AflSnity measuranent by 
plasmon resonance techniques may be carried out, for example, using a BIAcore 2000 instrument 
(Biacore AB, Uppsala, Sweden) in accordance wilh the manufecturer's recommended protocol. 
Preferably, afiBnity is measured by ELISA, for example, as described in U.S. patent 6,235,883, or 
like reference. Preferably, the dissociation constant betwem CPPs and monoclonal antibodies of the 

35 invention is less than 10"^ molar. More preferably, such dissociation constant is less than lO'* molar; 
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still moie preferably, such dissociation constant is less than 10'^ molar; and most preferably, such 

dissociation constant is in the range of 10"^ to 10'" molar. 

In addition, ih& antibodies of the present invration are usefid fer detecdng CPPs. Such 
detection methods are advantageously ^plied to diagnosis of cardiovascular disorders, in particular, 
5 coronary artery disease. The antibodies of the invention may be used in most assays involving 
antigen-antibody reactions. The assays may be homogeneous or heterogeneous. In a homogeneous 
assay approach, the sainple can be a biological sample or fluid such as serum, urine, whole blood, 
lymphatic fluid, plasma, saliva, cells, tissue, and material secreted by cells or tissues cultured in 
vitro. The sanq)le can be pretreated if necessary to remove imwanted materials. The immimological 
10 reaction usually involves the specific antibody, a labeled analyte, and the sample suspected of 
containing the antigen. The antigen can be directly labeled with any label group described herein. 
The signal arising from the label is modified, directly or indirectly, upon the binding of the antibody 
to the labeled analyte. Both immunological reaction and detection of the extent thereof are carried out 
in a homogeneous solution. Immunochemical labels vMch may be employed include firee radicals, 
15 fluorescent dyes, enzymes, bacteriophages, coenzymes, and so forth. 

In a heterogeneous assay sqpproach, the reagents are usually the sample, the specific 
antibody, and means for producing a detectable signal. The specimen is generally placed on a 
support, such as a plate or a slide, and contacted with the antibody in a liquid phase. The support is 
then s^arated fiom the hquid phase and either the support phase or the liquid phase is examined for 
20 a detectable signal enq)loying means for producing such signal or signal producing system. The 

signal is related to &e presence of the antigen in the sample. Means for producing a detectable signal 
includes the use of radioactive labels, fluorescent conipounds, enzymes, and so forth. Exemplary 
heterogeneous immunoassays are the radioimmunoassay, immunofluorescence methods, enzyme- 
linked inxmunoassays, and the like. 
25 For a more detailed discussion of the above immunoassay techniques, see "Enzyme- 

Immunoassay," by Edward T. Maggio, CRC Press, Inc., Boca Raton, Fla., 1980. See also, for 
example, U.S. Pat. Nos. 3,690,834; 3,791,932; 3,817,837; 3,850,578; 3,853,987; 3,867,517; 
3,901,654; 3,935,074; 3,984,533; 3,966,345; and 4,098,876, which listing is not intended to be 
exhaustive. Methods for conjugating labels to antibodies and antibody fragments are well known in 
30 the art. Such methods may be found in U.S. Pat. Nos. 4,220,450; 4,235,869; 3,935,974; and 

3,966,345. Another example of a technique in which the antibodies of the invention may be employed 
is immunpperoxidase labeling. (Stemberger, Immunocytochemistry (1979) pp. 104-169). 
Alternatively, the antibodies may be bound to a radioactive material or to a drug to form a 
radiopharmaceutical or pharmaceutical, respectively. (Carrasquillo, et al.. Cancer Treatment Reports 
35 (1984) 68:317-328). 
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One embodiment of an assay enq>loymg an antibody of the present invention involves the tisc 
of a surfece to whidi the monoclonal antibody of the invention is attached. The underlying structure 
of the sur&ce may take different forms, have different compositions and may be a mixture of 
compositions or laminates or combinations thereof. Hie sur&ce may assume a variety of shapes and 
-forms and may have varied dimensions, depending on the manner of use and measurement. 
Illustrative surfeces may be pads, beads, discs, or strips which may be flat, concave or convex. 
Thickness is not critical, generally being firom about 0. 1 to 2 mm thick and of any convenient 
diameter or other dimensions. The sinfece typically will be supported on a rod, tube, capillary, 
fiber, strip, disc, plate, cuvette and will typically be porous and polyfunctional or capable of being 
polyfimctionalized so as to permit covaloat binding of an antibody and permit bonding of other 
compounds which form a part of a means for producing a detectable signal. A wide variety of 
organic and inorganic polymers, both natural and synthetic, and combinations thereof, may be 
employed as the niaterial for the solid surfece. Illustrative polymers include polyethylene, 
polypropylene, poly(4-methylbutene), polystyrene, polymethraciylate, poly(e<hylene terephthalate), 
rayon, nylon, poly(vinyl butyrate), silicones, polyfonnaldehyde, cellulose, cellulose acetate, 
nitrocellulose, and latex. Other surfeces include paper, glasses, ceramcs, metals, metaloids, 
semiconductor materials, cements, silicates or the like. Also included are substrates that form gels, 
gelatins, lipopolysaccharides, silicates, agarose and polyacrylamides or polymers which form several 
aqueous phases such as dextrans, polyalkylene glycols (alkylene of 2 to 3 carbon atoms) or 
surfectants such as phospholipids. The bmding of the antibody to the surfece may be accomplished 
by well known techniques, commonly available in the literature (see, for example, *lmmobili^ 
En^es," Ichiro Chibata, Press, New York (1978) and Cuatrecasas, J, Bio. Chem., 245: 3059 
(1970)). In carrying out the assay in accordance with this aspect of the invention, the sample is 
mixed with aqueous medium and the medimn is contacted with the surfece having an antibody bound 
thereto. Labels may be included in the aqueous medium, either concurrently or added subsequently 
so as to provide a detectable signal associated with the surfece. The means for producing the 
detectable signal can involve the incorporation of a labeled analyte or it may involve the use of a 
second monoclonal antibody having a label conjugated thereto. Separation and washing steps will be 
carried out as needed. The signal detected is related to the presence of CPP in the sample. It is within 
the scope of the present invration to include a calibration on the same support. A particular 
embodiment of an assay in accordance with the present invention, by way of illustration and not 
limitation, involves the use of a support such as a slide or a well of a petri dish. The technique 
involves fixing the sample to be analyzed on the support with an appropriate fixing material and 
incubating the saraple on the slide with a monoclonal antibody. After washing witii an appropriate 
buffer such as, for exan5)le, phosphate buffered saline, Ihe support is contacted with a labeled 
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specific binding partner for the antibody. After incubation as desired, the slide is washed a second 

tune with an aqueous bufi^ and tiie determination is made of the binding of the labeled monoclonal 

antibody to the antigen. If tiie label is fluorescent, the slide may be covered with a fluorescent 

antibody mounting fluid on a cover slip and then examined vnth a fluorescent microscope to 

5 determine the extent of binding. On the other hand, the label can be an enzyme conjugated to the 
monoclonal antibody and the extent of binding can be determined by examining the slide for the 
presence of enzyme activity, vMch may be indicated by the formation of a precipitate, color, etc, A 
particular example of an assay utilizing the present antibodies is a double determinant ELISA assay. 
A support such as, e.g., a glass or vinyl plate, is coated with an antibody specific for CPP by 

10 conventional techniques. The support is contacted with the sample suspected of containing CPP, 
usually in aqueous medium. After an incubation period fi"om 30 seconds to 12 hours, the support is 
separated firom the medium, washed to remove unboimd CPP with, for exainple, water or an aqueous 
buffered medium, and contacted with an antibody specific for CPP, again usually in aqueous 
medium. The antibody is labeled with an enzyme directly or indirectly such as, e.g., horseradish 

15 peroxidase or alkaline phosphatase. Aft^ incubation, the support is separated firom the medium, and 
washed as above. The enzyme activity of the support or the aqueous medium is determined. This 
enzyme activity is related to the amount of CPP in the sanq)le. 

The invention also includes kits, e.g., diagnostic assay kits, for carrying out the methods 
disclosed above. la one embodiment, the kit conqirises in packaged combination (a) a monoclonal 

20 antibody more specifically defined above and (b) a conjugate of a specific binding partner for the 
above monoclonal antibody and a label capable of producing a detectable signal. The reagents may 
also include ancillary agents such as buffering agents and proteia stabilizing agents, e.g., 
polysaccharides and the like. The kit may finlher include, where necessary, ofbsr members of the 
signal producing system of which system the label is a member, agents for reducing backgroxmd 

25 interference in a test, control reagents, apparatus for cmducting a test, and the like. la another 

embofiiment, the diagnostic kit comprises a conjugate of monoclonal antibody of the invention and a 
label capable of producing a detectable signal. Ancillary agents as mentioned above may also be 
present. 

Fxuther, an anti-CPP antibody (e.g., monoclonal antibody) can be used to isolate CPPs by 
30 standard techniques, such as affibaity chromatography or inunimoprecipitation. For example, an anti- • 
CPP antibody can fiicilitate the pxnification of natural CPPs firom cells and of recombinandy 
produced CPP expressed in host cells. Moreover, an antirCPP antibody can be used to isolate CPP to 
aid ia detection of low concentrations of CPP (e.g., in plasma, cellular lysate or cell supernatant) or 
in order to evaluate the abundance and pattern of expression of the CPP. Anti-CPP antibodies can be 
35 used diagnostically to monitor protem levels in tissue as part of a clinical testing procedure, e.g., to 
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detennine the efficacy of a given treatment regunen. 
physically linking) the antibody to a label groiq). 
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Detection can be fiicilitated by coupling (i.e.. 



Protein Arrays 

5 Detection, purification, and screening of the polypeptides of the invention may be 

accomplished using retentate chromatography (preferably, protein arrays or chips), as described by 
U.S. Patent 6225027 and U.S. Patent Application 20010014461, disclosures of which are herein 
incoiporated by reference in their entireties. Briefly, retentate chromatography describes methods in 
which polypeptides (and/ or other sample components) are retained on an adsorbent (e.g., array or 

10 chip) and subsequently detected. Such methods involve (1) selectively adsorbing polypeptides from a 
sample to a substrate under a plurality of different adsorbent/eluant combinations ("selectivity 
conditions") and (2) detecting the retention of adsorbed polypeptides by desorption spectrometry 
(e.g., by mass spectrometry). In conventional chromatographic methods, polypeptides are eluted off 
of the adsorbent prior to detection. The coupling of adsorption chromatography with detection by 

15 desorption spectrometry provides extraordinary sensitivity, the ability to rapidly analyze retained 
conq)onents with a variety of dififeient selectivity conditions, and parallel processiiig of components 
adsorbed to different sites (i.e., "afBnily sites" or "spots") on the array under different elution 
conditions. 

These methods are useful for. combinatorial, biochemical separation and purification of the 
20 CPPs; study of differential gene ^ression; detection of differences in protein levels (e.g., fijr 

diagnosis); and detection of molecular recognition events (e.g., fi)r screening and dmg discovery). 
Thus, this invention provides a molecular discovery and diagnostic device that is characterized by the 
inclusion of both parallel and multiplex polypeptide processing capabilities. Polypeptides of the 
invention and CPP-binding substances are preferably attached to a label group, and thus directly 
25 detected, enabling simultaneous transmission of two or more signals from the same "circuit" (i.e., 
addressable "chip" location) during a single unit operation. 

Detection of CPPs by mass spectrometry 

In accordance with the present invention, any instrument, method, process, etc. can be 
30 utilized to determine the identity and abundance of proteins in a sample. A preferred method of 

obtaining identity is by mass spectrometry, where protein molecules in a sample are ionized and then 
the resultant mass and diarge of the protein ions are detected and determined. 

To use mass spectrometry to analyze proteins, it is preferred that the protein be converted to 
a gas-ion phase. Various methods of protein ionization are usefiil, including, e.g., &st ion 
35 bombardment (FAB), plasma desorption, laser desorption, thermal desorption, preferably, 
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electrospray ionization (ESI) axul matrix-^sisted laser desoiptionAonization (MALDI). Many 
different mass analyzers are available for peptide and protein analysis, including, but not limited to, 
Time-of-Flight (TOF), ion trap (TTMS), Fourier transform ion cyclotron (FTMS), quadrupole ion 
trap, and sector (electric and/or magnetic) spectrometers. See, e.g., U.S. Pat. No. 5,572,025 for an 
5 ion-trap MS. Mass analyzers can be used alone, or in combination with other mass analyzers in 

tandem mass spectrometers. In the latter case, a first mass analyzer can be use to separate the protein 
ions (precursor ion) firom each other and determine the molecular weights of the various protein 
constituents in the sample. A second mass analyzer can be used to analyze each separated 
constituents, e.g., by firagmenting the precursor ions into product ions by using, e.g. an inert gas. 
10 Any desired combination of mass analyzers can be used, including, e.g., triple quadrupoles, tandem 
time-of-flights, ion traps, and/or combinations thereof. 

Different kinds of detectors can be used to detect the protein ions. For example, destructive 
detectors can be utilized, such as ion electron multipUers or cryogeioic detectors (e.g., U.S. Pat. No. 
5,640,010). Additionally, non-destructive detectors can be used, such as ion traps which are used as 
15 ion curr^ pick-up devices in quadrupole ion trap mass analyzers or FTMS. 

For MALDI-TOF, a number of sant^le preparation methods can be utilized including, dried 
droplet (KarasandHBllenkamp, Anal. Chem., 60:2299-2301, 1988), vacuum-drying (Winberger et 
al.. In Proceedings of the 41st ASMS Conference on Mass Spectrometry and Allied Topics, San 
Francisco, May 31-June 4, 1993, pp. 775a-b), crush crystals (Xiang et al.. Rapid Comm. Mass 
20 Spectrom., 8:199-204,1994), slow crystal growing (Xiang et al.. Org. Mass Spectrom, 28:1424- 
1429, 1993); active fihn (Mock et al.. Rapid Comm. Mass Spectrom.,6:233-238, 1992; Bai et al.. 
Anal. Chem., 66:3423-3430, 1994), pneumatic spray (Kochling et al,. Proceedings of the 43rd 
ASMS Conference on Mass Spectrometry and Allied Topics; Atlanta, GA, May 21-26, 1995, 
pl225); electrospray (Hensel et al.. Proceedings of the 43rd ASMS Conference on Mass 
25 Spectrometry and Allied Topics; Atlanta, GA, May 21 -26, 1995, p947); fest solvent evaporation 
(Vorm et al., Anal. Chem., 66:3281-3287, 1994); sandwich (Li et al., J. Am, Chem. Soc, 11 
8:11662-11663,1996); and two-layer methods (Dal et al.. Anal. Chem., 71:1087-1091, 1999). See 
also, e.g., Liang et al.. Rapid Commun. Mass Spectrom., 10: 1219-1226, 1996; van Adrichemet al.. 
Anal. Chem., 70:923-930, 1998. 
' 30 For MALDI analysis, samples are prepared as solid-state co-crystals or thin films by mixing 

them with an energy absorbing compound or colloid (the matrix) in the.Uquid phase, and ultimately 
drying the solution to the soUd state upon the sur&ce of an inert probe. In some cases an energy 
absorbing molecule (EAM) is an integral conq)onent of the sample presenting sur&ce. Regardless of 
EAM application strategy, the probe contents are allowed to dry to the solid state prior to 
35 mtroduction into the laser desorptionAonization time-of-£Ught mass spectrometer (LDIMS). 
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Ion detection in TOP mass spectrometry is typically achieved with the use of electro- 
emissive detectors such as electron multipliers (EMP) or microchannel plates (MCP). Both of tiiese 
devices fimction by coovertmg primary incident charged particles into a cascade of secondary, 
tertiary, quaternary, etc. electrons. Hie probability of secondary electrons being generated by the 

5 impact of a single incident charged particle can be taken to be the ion-to-electron conversion 
efficiency of this charged particle (or more simply, the conversion efficiency). The total electron 
yield for cascading events when compared to the total number of incident charged particles is 
typically described as the detector gain. Because generally the overall response time of MCPs is far 
superior to that of EMPs, MCPs are the preferred electro-emissive detector for enhanciag 

10 mass/charge resolving power. However, EMPs function well for detecting ion populations of 
disbursed kinetic energies, where rapid response time and broad firequracy bandwidth are not 
necessary. 

In a preferred aspect, for the analysis of digested proteins, a Uquid-chromatography tandem 
mass spectrometer (LC-TMS) is used. This system provides an additional stage of sanq)le separation 

1 5 via use of a liquid chromatograph followed by tandem mass spectrometry. 

In preferred aspects, a protein eluted from a column according to the systKn descril^ 
Exaniple 1 is analyzed using both MS and MS-MS analysis. For exaiiq)le, a small portion of intact 
proteins eluting from RP2 may be diverted to online detection using LC-ESI MS. The proteins are 
aliquoted on a number of plates allowing digestion or not with trypsin, preparation for MALDI-MS 

20 as well as for ESI-MS, as well as preparation of the MALDI plates with dififerent matrices. The 

methods thus allow, in addition to information on intact mass, to conduct an analysis by botiii peptide 
mass fingerprinting and MS-MS techniques. 

The methods described herein of separating and fractionating proteins provide individual 
proteins or fractions containing small numbers of distinct proteins. These proteins can be identified 

25 by mass spectral determination of the molecular masses of the protein and peptides resulting firom the 
fragmentation thereof Making use of available information in protein sequence databases, a 
comparison can be made between proteolytic peptide mass patterns generated in silica, and 
e?q)erimentally observed peptide masses. A "hit-list" can be compiled, ranking candidate proteins in 
the database, based on (among other criteria) the number of matches between the theoretical and 

30 experimental proteolytic fragments. Several Web sites are accessible that provide software for 
protein identification on-line, based on peptide mapping and sequence database search strategies 
(e.g., http://www.expasy.ch). Methods of peptide mapping and sequencing using MS are described in 
WO 95/252819, U.S. Pat. No. 5,538,897, U.S. Pat. No. 5,869,240, U.S. Pat. No. 5,572,259, and 
U.S. Pat. No. 5,696,376. See, also, Yates, J. Mass Spec., 33:1 (1998). 

35 Data collected from a mass spectrometer typically comprises the intensity and mass to 

61 



wo 2004/061096 PCT/EP2004/000027 
charge ratio for each detected event. Spectral data can be recorded in any suitable fonn, including, 
e.g., in graphical, numerical, or electronic formats, either in digital or analog form. Spectra are 
preferably recorded in a storage medium, including, e.g., magnetic, such as floppy disk, tape, or hard 
disk; optical, such as CD-ROM or laser-disc; or, ROM-CHIPS. 

The mass spectrum of a given sample typically provides information on protein intensity, 
mass to charge ratio, and molecular weight. In preferred embodiments of the invention, the molecular 
weights of proteins in the sample are used as a matching criterion to query a database. The molecular 
wei^ts are calculated conventionally, e.g., by subtracting the mass of the io nizing proton for singly- 
charged protonated molecular ions, by multiplying the measxured mass/charge ratio by the number of 
charges for multiply-charged ions and subtracting the number of ionizing protons. 

Various databases are useful in accordance with the present invention. Useful databases 
include, databases containing genomic sequences, expressed gene sequences, and/or expressed 
protein sequences. Preferred databases contain nucleotide sequence-derived molecular masses of 
protems present in a known organism, organ, tissue, or cell-type. There are a number of algorithms 
to identify open reading frames (ORF) and convert nucleotide sequences into protem sequence and 
molecular weight information. Several publicly accessible databases are available, including, tiie 
SwissPROT/TrEMBL database Cht1p://www.expasy.ch). 

Typically, a mass spectrometer is equipped wifli commercial software that identifies peaks 
above a certain threshold level, calculates mass, dbarge, and intensity of detected ions. Correlating 
molecular weight with a given output peak can be accQnq>lished directiy from the spectral data, i.e., 
where the charge on an ion is one and the molecular weight is therefore equal to the numerator value 
minus the mass of the ionizing proton. However, protein ions can be complexed with various 
counter-ions and adducts, such as N, C, and K\ In such a case, it would be expected that a given 
protein ion would exhibit multiple peaks, such as a triplet, representing different ionic states (or 
species) of the same protein. Thus, it may be necessary to analyze and process spectral data to 
determine femilies of peaks arising from the same protein. This analysis can be carried out 
conventionally, e.g., as described by Mann et al., anal. Chem., 61:1702-1708 (1989). 

In matching a molecular mass calculated from a mass spectrometer to a molecular mass 
predicted from a database, such as a genomic or expressed gene database, post-translation 
processing may have to be considered. There are various processing events which modify protein 
structure, including, proteolytic processmg, removal of N-terminal methionine, acetylation, 
methylation, glycosylation, phosphorylation, etc. 

A database can be queried for a range of proteins matching die molecular mass of the 
unknown. The raiige window can be determined by the accuracgr of the mstrument, the method by 
which the sample was prepared, etc. Based on the number of hits (where a hit is matdi) in the 
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spectrum, the uoknown protein or peptide is identified or classified. 

Methods of identifying one or more CPP by mass spectrometry are useful for diagnosis and 
prognosis of cardiovascular disorders. Preferably, such methods are used to detect one or more CPP 
present in himian plasma. Exemplary techniques are described in U.S. Patent Applications 
5 02/0060290, 02/0137106, 02/0138208, 02/0142343, 02/0155509, disclosures of which are 
incorporated by reference in their entireties. 

Diagnostic and Prognostic Uses 

The nucleic acid molecules, proteins, protein homologues, and antibodies described herein 

10 can be used in one or more of the following methods: diagnostic assays, prognostic assays, 
monitoring clinical trials, screening assays, and pharmacogenetics as further described hereia. 

The invention provides diagnostic and prognostic assays for detecting CPP nucleic acids and 
proteins, as finilier described. Also provided are diagnostic and prognostic assays for detecting 
interactions betweea CPPs and CPP target molecules, particularly natural agonists and antagonists. 

15 The present invention provides methods for identifying polypeptides that are differentially 

expressed between two or more samples. 'Differential e3q)ression" refers to differences in the 
quantity or quality of a polypeptide between saoqples. Such differences could result at any stage of 
protein expression fiom transcription througjh post-translational modification. For example, using 
protein array methods, two sa]x^)les are bound to affinity spots on different sets of adsorbents (e.g., 

20 chips) and recognition maps are compared to identify polypeptides that are differentially retained by 
the two sets of adsorbents. Differential retention includes quantitative retention as well as qualitative 
differences in the polypeptide. For exainple, differences in post-translational modification of a 
protein can result in differences in recognition maps detectable as differences in binding 
characteristics (e.g., glycosylated proteins bind differently to lectin adsorbents) or differences in 

25 mass (e.g., post-translational cleavage products). In certain embodiments, an adsorbent can have an 
array of affinity spots selected for a combination of markers diagnostic for a disease or syndrome. 

Differences in polypeptide levels between samples (e.g., differentially expressed CPPs in 
plasma samples) can be idratified by e:q)osing the samples to a variety of conditions for analysis by 
desorption spectrometry (e.g., mass spectrometry). Unknown proteins can be identified by detecting 

30 physicochemical characteristics (e.g., molecular mass), and this information can be used to search 
databases for proteins having similar profiles. 

Preferred methods of detecting a CPP utilize mass spectrometry techniques. Such methods 
provide information about the size and character of the particular CPP isoform that is present in a 
sample, e.g., a biological sample submitted for diagnosis or prognosis. Mass spectrometry techniques 

35 are detailed in the section titled *^tection of CPPs by mass spectrometry". Example 1 outlines a 
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preened detection scheme, in^erein a biological sample is separated by chromatography before 
characterization by mass spectrometry. The invention provides a method of detecting a CPP in a 
biological sample comprising the steps of: fractionating a biological sanople (e.g., plasma, serum, 
lymph, cerebrospiaal fluid, cell lysate of a particular tissue) by at least one chromatographic step; 

5 subjecting a fraction to mass spectrometry; and comparing the characteristics of polypeptide species 
observed in mass spectrometry with known characteristics of CPP polypeptides (e.g., CPP 6-7, as 
disclosed in Figure 1). • 

The isolated nucleic acid molecules of the invention can be used, for example, to detect CPP 
mRNA (e.g., in a biological sample) or a genetic alteration in a CPP-encoding gene, and to modulate 

10 a CPP activity, as described further below. The CPP can be used to treat disorders characterized by 
insufficient production of a CPP or by excessive production of a CPP target molecule. In addition, 
the CPPs can be used to screen for naturally occurring CPP target molecules, to screen for drugs or 
compounds which modulate, preferably activate CPP activity, as well as to treat disorders 
characterized by msufficient production of CPP or production of CPP forms which have decreased 

15 or aberrant activity compared to wild type CPP. Moreover, the anti- CPP antibodies of the invention 
can be used to detect and isolate CPP, regulate the bioavailability of CPP, and modulate CPP 
activity. 

Accordingly one embodiment of the present invention involves a method of use (e.g., a 
diagnostic assay, prognostic assay, or a prophylactic/therapeutic method of treatment) wherein a 

20 molecule of the present invention (e.g., a CPP, CPP nucleic acid, or CPP inhibitor or activator) is 
used, for example, to diagnose, prognose and/or treat a disorder in which any of the aforementioned 
CPP activities is indicated. In another embodiment, the present invention involves a method of use 
(e.g., a diagnostic assay, prognostic assay, or a prophylactic/therapeutic method of treatment) 
wherein a molecule of the present invention is used, for example, for the diagnosis, prognosis, and/or 

25 treatment of subjects, preferably a human subject, in which any of the aforementioned activities is 
pathologically perturbed. In a preferred embodiment, the methods of use involve administering to a 
subject, preferably a human subject, a molecule of the present invention for the diagnosis, prognosis, 
and/or therapeutic treatment In another embodiment, the methods of use involve administering to a . 
human subject a molecule of the present invention. 

30 For example, the invention encompasses a method of determining whether a CPP is 

expressed within a biological sample comprising: a) contacting said biological sample with: i) a 
polynucleotide that hybridizes under stringent conditions to a CPP nucleic add; or ii) a detectable 
polypeptide (e.g. antibody) that selectively binds to a CPP; and b) detecting the presence or absence 
of hybridization between said polynucleotide and an RNA species wilfain said sanq)le, or the presence 

35 or absence of bindmg of said detectable polypeptide to a polypeptide within said sample. Detection 
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of said hybridizadon or of said binding indicates that said CPP is expressed within said sample. 
Preferably, the polynucleotide is a primer, and wherein said hybridization is detected by detecting the 
presence of an amplification product comprising said primer sequence, or the detectable polypeptide 
is an antibody. 

5 In certain embodim^ts, detection involves Hhe use of a probe/primer in a polymerase chain 

reaction (PGR) (see, e.g., U.S. Pat. Nos. 4,683,195 and 4,683,202, the disclosures of which are 
incorporated herein by reference in their entireties), such as anchor PGR or RACE PGR, or, 
attematively, in a ligation chain reaction (LGR) (see, e.g., Landegren et al. (1988) Science 241:1077- 
1080; andNakazawa et al. (1994) PNAS 91:360-364, tiie disclosures of which are incorporated 
10 herein by reference in their entireties), the latter of which can be particularly usefiil for detecting 
point mutations in the CPP-encoding-gene (see Abravaya et al. (1995) Nucleic Acids Res. 23:675- 
682, the disclosure of which is incorporated herein by reference in its entirety). 

Also envisioned is a method of determining whether a mammal, preferably hu^ian, has an 
elevated or reduced level of ejqpression of a GPP, comprising: a) providing a biological sample from 
15 said mammal; and b) comparing the amount of a CPP or of a GPP KNA species encoding a GPP 
within said biological saraple with a level detected m or expected from a control sample. An 
increased amount of said GPP or said GPP RNA species within said biological sample compared to 
said level detected in or expected from said control sanqple indicates that said mammal has an 
elevated level of GPP expression, and a decreased amount of said GPP or said GPP RNA species 
20 within said biological sample compared to said level detected in or expected from said control sample 
indicates that said mammal has a reduced level of expression of a GPP. 

The present invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, and monitoring clinical trials are used for prognostic purposes to thereby 
treat an individual prophylacticeilly . Accordingly, one aspect of the present invention relates to 
25 diagnostic assays for determining GPP and/or nucleic acid expression as well as GPP activity, in the 
context of a biological sample (e.g., blood, plasma, cells, tissue) to thereby determine whether an 
individual is afflicted with a disease or disorder, or is at risk of developing a disorder, associated 
with aberrant GPP expression or activity. The invention also provides for prognostic (or predictive) 
assays for determining vsdiether an individual is at risk of developing a disorder associated with a 
30 GPP, nucleic acid expression or activity. For example, noutations in a GPP-encoding gene can be 
assayed in a biological san^le. Such assays can be used for prognostic or predictive purpose to 
thereby prophylactically treat an individual prior to &e onset of a disorder characterized by or 
associated with GPP expression or acdvity. 

The term "biological sample" is intended to include tissues, cells and biological fluids 
35 isolated Scata an individual, as well as tissues, cells and fluids present within an individual. That is, 
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the detection methods of the inventiou can be used to detect a CPP mRNA, protein, or genomic DNA 
in a biological sanople in vitro as well as in vivo. Preferred biological samples are biological fluids 
such as lymph, cerebrospinal fluid, blood, and especially blood plasma. For example, in vitro 
techniques for detection of a CPP mRNA include Northern hybridizations and in situ hybridiTations. 
In vitro techniques for detection of a CPP include mass spectrometry. Enzyme Linked Immuno 
Sorbent Assays (ELIS As), Western blots, immunoprecipitations and immunofluorescence. In vitro 
techniques for detection of a CPP-^coding genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of a CPP include introducing into an individual a 
labeled anti- CPP antibody. 

In preferred embodiments, the subject methods can be characterized by generally conq)rising 
detecting, in a tissue samplo of the individual (e.g. a human patient), the presence or absence of a 
gmetic lesion characterized by at least one of (i) a mutation of a gene encoding one of the subject 
CPP or (ii) the mis-ejq)ressiQn of a CPP-^ncoding gene. To illustrate, such genetic lesions can be 

• detected by ascertaining the existence of at least one of (i) a deletiOTi of one or more nucleotides from 
the CPP-encoding gene, (ii) an addition of one or more nucleotides to the gene, (iii) a substitution of 
one or more nucleotides of the gene, (iv) a gross chromosomal rearrangement or amplification of llie 
gene, (v) a gross alteration in the level of a messenger KNA transcript of the gene, (vi) aberrant 
modification of the gene, such as of the methylation pattern of the genomic DNA, (vii) the presence 
of a non-wild type splicing pattern of a messenger RNA transcript of the gene, and (viii) reduced 
level of e2q>res5ion, indicating lesion in r^latory element or reduced stability of a CPP-encoding 
transcript. 

In yet another exemplary embodiment, aberrant methylation patterns of a CPP nucleic acid 
can be detected by digesting genomic DNA from a patient sample with one or more restriction 
endonucleases that are sensitive to methylation and for which recognition sites exist in the CPP- 
encoding gene (including in the flanking and intronic sequences). See, for example. Suiting et al. 
(1994) Human Mol Genet 3:893-895. Digested DNA is separated by gel electrophoresis, and 
hybridized with probes derived from, for example, genomic or cDNA sequences. The methylation 
status of the CPP-encoding gene can be determined by comparison of the restriction pattern 
generated from the sample DNA with that for a standard of known methylation. 

In yet another embodiment, a diagnostic assay is provided whidi detects the ability of a CPP 
to bind to a cell sur&ce or extracellular protein. For instance, it will be desirable to detect CPP * 
mutants which, while expressed at i^preciable levels in the cell, are defective at binding a CPP target 
protein Qiaving either diminished or enhanced binding afiEuuty for the target). Sudi mutants may 
arise, for example, from mutations, e.g., point mutants, whidi may be impractical to detect by the 
diagnostic DNA sequencmg techniques or by the immunoassays described above. Hie present 
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mventton accordingly fiirfher CQntectplates diagnostic screaung assays which generally comprise 

cloning one or more CPP-encoding gene from the sample tissue, and expressing the cloned genes 

under conditions which permit detection of an interaction between that recomibinant gene product and 

a target protein. As will be apparmt from the description of the various assays set forth herein under 

5 the Section entitled "Drug screening assays", a wide variety of techniques can be used to determine 
the ability of a CPP to bind to other conaponents. These techniques can be used to detect mutations in 
a CPP-encoding gene which give rise to mutant proteins with a higher or lower binding afBnity for a 
CPP target protein relative to the wild-type CPP. Conversely, by switching which of the CPP target 
protein and CPP is the "bait" and which is derived from the patient sample, the subject assay can 

10 also be used to detect CPP target protein mutants which have a higher or lower binding affinity for a 
CPP relative to a wild type form of that CPP target protein. 

In an exen^lary embodiment, a target protein can be provided as an immobilized protein (a 
"taiget"), such as by use of GST frision proteins and glutathione treated microtiter plates as 
described herein. 

15 In another embodiment, the methods frirther involve obtaining a control biological sample 

from a control subject, contacting the control sample with a compound or agent capable of detecting 
a CPP, mRNA, or genomic DNA, such that the presence of a CPP, mRNA or genomic DNA is 
detected in the biological san[q)le, and comparing Ifae presence of a CPP, mRNA or genomic DNA in 
the control sanqile with the presence of a CPP, mRNA or genomic DNA in the test sample. The 

20 invention also enconq)asses kits for detecting the presmce of a CPP, mRNA or genomic DNA in a. 
biological sample. For example, the kit can comprise: a labeled compound or agent capable of 
detecting a CPP, ntiRNA or genomic DNA in a biological sample; means for detennining the amount 
of a CPP in the sample; and means for comparing the amount of CPP in the sample with a standard. 
The compound or agent can be packaged in a suitable container. The kit can further comprise 

25 instructions for usiog the kit to detect CPP or nucleic acid. 



Drug screening assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying candidate modulators (e.g., small molecules, peptides, antibodies, peptidomimetics or 

30 other drugs) which bind to CPPs, have an inhibitory or activating effect on, for example, CPP 

expression or preferably CPP biological activity, or have an inhibitory or activating effect on, for 
example, the activity of a CPP target molecule. In some embodiments small molecules can be 
generated using combinatorial dhemistry or can be obtained from a natural products library. Assays 
may be ceQ based or non-cell based assays. Dmg screening assays may be binding assays or more 

35 preferentially functional assays^ as further described. 
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When the inventiaa is used for drug development, e.g., to determine the abiUty of a CPP 
polypeptide or candidate modulator to induce an anti-cardiovascular disorder response, the body 
fluid analyzed for the level of at least one CPP is preferably from a non-human mammal. The non- 
human mammal is preferably one in whidi the induction of an anti-cardiovascxilar disorder response 
5 by endogenous and/or exogenous agents is predictive of the induction of such a response in a human. 
Rodents (mice, rats, etc.) and primates are particularly suitable for use in this aspect of the 
invention. 

Agents that are found, using screening assays as further described herein, to modulate CPP 
activity by at least 5%, more preferably by at least 10%, still more preferably by at least 30%, still 
10 more preferably by at least 50%, still more preferably by at least 70%, even more preferably by at 
least 90 %, may be selected for further testing as a prophylactic and/or therapeutic anti- 
cardiovascular disease agent. 

Agents that are found to increase CPP activity may be used, for example, to reduce the 
synqptoms of a cardiovascular disorder alone or in combination with other £^propriate agents' or 
15 treatments. Agents found to inhibit CPP activity may be used, for example, to modulate treatment 
rogunens for cardiovascular disorders. 

In another aspect, agents that are found, using screening assays as further described herein, 
to modulate CPP e>q)ression by at least 5%, more preferably by at least 10%, still more preferably 
by at least 30%, still more preferably by at least 50%^ still more preferably by at least 70%, even 
20 more preferably by at least 90 %, may be selected for further testing as a prophylactic and/or 
therapeutic anti-cardiovascular disease agent. 

Protein array methods are useful for screening and drug discovery. For example, one 
member of a receptor/ ligand pair is docked to an adsorbent, and its abiUty to bind the binding 
partner is determined in the presence of the test substance. Because of the r^idity with which 
25 adsorption can be tested, combinatorial libraries of test substances can be easily screened for their 
abiUty to modulate the interaction. In preferred screening methods, CPPs are docked to the 
adsorbent. Binding partners are preferably labeled, thus enabling detection of the interaction. 

Alternatively, in certain embodiments, a test substance is docked to the adsorbent. The 
polypeptides of the invention are exposed to the test substance and screened for binding. Preferred 
30 test substances include substances correlated with a disease or disorder, such as a protein, lipid, or 
endocrine fector differentially present in disease (preferably, a cardiovascular disease). 

In other embodiments, an assay is a cell-based assay in which a cell which expresses a CPP 
or biologically active portion thereof is contacted with a test compound and the ability of tiie test 
compound to inhibit, activate, or increase CPP activity determined. Determining the ability of the test 
35 compound to inhibit, activate, or increase CPP activity can be accomplished by monitoring the 
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bioactivity of the CPP or biologically active portion thereof. The cell, for example, can be of 
mammalian origin, insect origin, bacterial origin or a yeast cell. 

In one embodiment, the invention provides assays for screening candidate or test ccmpoimds 
which are target molecules of a CPP or biologically active portion thereof In another embodiment, 

5 the invention provides assays for screening candidate or test compomids which bind to or modulate 
the activity of a CPP or biologically active portion thereof The test compomids of the present 
invention can be obtained using any of the numerous approaches in combinatorial Ubrary methods 
known in the art, including: biological libraries; spatially addressable parallel solid phase or solution 
phase libraries; synthetic Ubrary methods requiring deconvolution; the 'one-bead one-compound' 

10 library method; and synthetic Ubrary methods using affinity chromatography selection. The 
biological Ubrary approach is used with peptide Ubraries, while the other four approaches are 
appUcable to peptide, non-peptide oUgomer or small molecule Ubraries of compounds (Lam, K. S. 
(1997) Anticancer Drug Des. 12:145, the disclosure of which is incorporated herein by reference in 
its entirety). 

15 Examples of methods for the synthesis of molecular Ubraries can be found in the art, for 

example in: DeWitt et al. (1993) Proc. NaU. Acad. Sd. U.S.A. 90:6909; Erb et al. (1994) Proc. 
Nad. Acad. Sci. USA 91:1 1422; Zuckennann et al. (1994). J. Med. Chem. 37:2678; Cho et al. 
(1993) Science 261:1303; CarreU et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2059 and 2061; and 
in Gallop et al. (1994) J. Med. Chem. 37:1233, Ae disclosures of which are incorporated herein by 
20 reference in their entireties. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) Biotedhniques 
13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) Nature 364:555- 
556), bacteria (Ladner U.S. Pat. No. 5,223,409), spores (Ladner U.S. Pat. No. '409), plasmids (Cull 
et al. (1992) Proc Natl Acad Sd USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 
25 249:386-390); (Devin (1990) Science 249:404-406); (Cwirla et al. (1990) Proc. Natl. Acad. Sci. 
87:6378-6382); (FeUci (1991) J. Mol. Biol. 222:301-310); (Ladner supra.). 

Determining the abiUty of the test compound to inhibit or increase CPP activity can also be 
accompUshed, for example, by coupling the CPP or biologically active portion thereof with a label 
group sudi that binding of the CPP or biologically active portion thereof to its cognate target 
30 molecule can be determined by detecting the labeled CPP or biologically active portion thereof in a 
complex. For example, the extent of complex formation may be measured by immunoprecipitating 
the complex or by performing gel electrophoresis. 

It is also within tibie scqpe of this invention to determine the abiUty of a coixq}oundto interact 
with its cognate target molecule without the labeUng of any of the interactants. For exaitq)le, a 
35 micrpphysiometer canbe used to detect the interaction of a compound with its cognate target 
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molecule without the labeling of either the compound or tiie target molecule. McConnell, H. M. et al. 
(1992) Science 257:1906-1912, the disclosure of which is incorporated by referrace in its entirety. 
A microphysiometer such as a cytosensor is an analytical instrument that measures the rate at wHch 
a cell acidifies its environment using a Light-Addressable Potentiometric Sensor (LAPS). Changes in 
5 this acidification rate can be used as an indicator of tite interaction between compound and receptor. 

In a preferred embodiment, the assay comprises: contacting a cell which e3q)resses a CPP or 
biologically active portion thereof with a target molecule to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compoimd to inhibit or 
increase the activity of the CPP or biologically active portion thereof Determining the ability of the 
10 test compound to inhibit or increase the activity of the CPP or biologically active portion thereof . 
comprises: detennining the ability of the test compound to inhibit or increase a biological activity of 
the CPP expressiiig cell (e.g., interaction with a CPP target molecule, as discussed above). 

In another preferred embodiment, the assay con:q)rises contacting a cell which is responsive 
to a CPP or biologically active portion thereof with a CPP or biologically active portion thereof to 
15 form an assay mixture, contacting the assay mixture with a test compound, and detennining the 
ability of the test conq)Ound to modulate the activity of the CPP or biologically active portion 
thereof. Detennining the ability ofthe test compound to modulate the activ^ 
biologically active portion thereof conq}rises determining the ability of the test compound to 
modulate a biological activity of the CPP-responsive cell. 
20 In ano&er embodiment, an assay is a cell-based assay comprising contacting a cell 

e3q)ressing a CPP target molecule (i.e. a molecule with which CPPs interact) with a test compound 
and determining the ability of the test compound to modulate (e.g. stimulate or inhibit) the activity of 
the CPP target molecule. Detennining the ability of the test compound to modulate tiie activity of a 
CPP target molecule can be accomplished, for example, by assessing the activity of a target 
25 molecule, or by assessing the ability of the CPP to bind to or interact with the CPP target molecule. 

Detennioing the ability of the CPP to bind to or interact with a CPP target molecule, for 
example, can be accomplished by one of the methods described above for directly or indirectly 
determining binding. In a preferred embodiment, the assay includes contacting the CPP or 
biologically active portion thereof with a known compound which binds said CPP (e.g., a CPP 
30 antibody or target molecule) to form an assay mixture, contacting the CPP with a test compoimd 
before or after said known compound, and determining the ability of the test compound to interact 
with the CPP. Determining the ability of the test compound to interact with a CPP comprises 
detennining the ability of the test conq>ound to preferentially bind to the CPP or biologically active 
portion thereof as compared to the known compound. . Detennining the ability of the CPP to bind to 
35 a CPP .target molecule can also be accomplished using a technology such as real-time Biomolecular 
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Interaction Analysis (BIA). Sjolander, S. and Urbanic2aqr, C. (1991) Anal. Ghent 63:2338-2345 and 
Szabo et al. (1995) Curr. Opin. Struct. Biol. 5:699-705, the disclosures of which are incorporated 
herein by reference in their entireties. As used herein, "BIA" is a technology for studying biospedfic • 
interactions in real time, without labeling any of the interactants (e.g., BIAcore). Changes in the 
optical phenomenon of surface plasmon resonance (SPR) can be used as an indication of real-time 
reactions between biological molecules. 

In another embodiment, the assay is a cell-free assay m which a CPP or biologically active 
portion thereof is contacted with a test compound and the ability of the test compound to modulate 
(e.g., stimulate or inhibit) the activity of the CPP or biologically active portion thereof is detemuned. 
In a preferred embodiment, determiiiing the abihty of the CPP to modulate or interact with a CPP 
target molecule can be accomplished by determining the activity of the target molecule. For example, 
the activity of tiie target molecule can be determined by contacting tiie target molecule with the CPP 
or a fragment thereof and measuring induction of a cellular second messenger of the target (e.g., 
cAMP, STAT3, Akt, intracellular Ca2+, diacylglycerol, IP3, etc.), detecting catalytic/enzymatic 
activity of the target for an appropriate substrate, detecting the induction of a reporter gene 
'.(con9>risiiig a target-responsive regulatory element operatively linked to a nucleic acid encoding a 
detectable marker, e.g., hiciferase), or detecting a target-regulated cellular response, for exanq^le, 
signal transduction or protein:protein interactions. 

The cell-free assays of the present invention are am^iable to use of both soluble and/or 
membrane-bound forms of isolated proteins (e.g. CPPs or biologically active portions thereof or 
molecules to which CPPs targets bind). In the case of cell-free assays in which a membrane-bound 
form an isolated protein is used it may be desirable to utilize a solubilizing agent such that the 
membrane-bound form of the isolated protein is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n- 
dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton TM X-100, 
Triton TM X-114, Thesit TM, Isotridecypoly(ethylene glycol ether)n,3-[(3- 
cholamidopropyl)dimethylarriminio]- 1-propane sulfonate (CHAPS), 3-[(3- 

cholaimdopropyl)dimethylanMninio]-2-hydroxy-l-propane sulfonate (CHAPSO), orN-dodecyl=N,N- 
dimethyl-3-ammonio-l-propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, it may be 
desirable to immobilize eidier a CPP or its target molecule to &cilitat6 separation of complexed from 
uncomplexed forms of one or both of the proteins, as well as to accommodate automation of the 
assay. Binding of a test conqpound to a CPP, or interaction of a CPP with a target molecule in the 
presence and absence of a candidate conq)ound, can be accoirqplished in any vessel suitable for 
containing tiie reactants and by any immobilization protocol desaibed herein. Altematively, the 
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conq)l6xes can be dissociated firom the matrix, and the level of CPP binding or activity detennined 

using standard techniques. 

Other tedmiques for immobilizing proteins on matrices can also be used in the screening . 

assays of the invention. For example, either a CPP or a CPP target molecule can be immobilized 

5 utilizing conjugation of biotin and streptavidin. Biotinylated CPP or target molecules can be 

prepared from biotin-NHS (N-hydroxy-succinimide) using techniques weU known in the art (e.g., 

biotinylation kit. Pierce Chemicals, Rockford, HI.), and immobilized in the wells of streptavidin- 

coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive with CPP or target 

molecules but which do not interfere with binding of the CPP to its target molecule can be 

10 derivatized to the weUs of the plate, and unbound target or CPP trapped in the wells by antibody 

conjugation. Methods for detecting such complexes, in addition to those described above for the 

GST-immobilized complexes, include immunodetection of complexes using antibodies reactive with 

the CPP or taiget molecule, as well as enzyme-linked assays which rely on detecting an en^yncnatic 

activity associated with the CPP or target molecule. ** 

15 In another embodiment, modulators of CPP e?q>ression are identified in a method vdierein a " 

ceU is contacted with a candidate compound and the ^q>ressiQn of CPP mKNA or protein 

is determined. The level of estpression of CPP mRNA or protein in the presence of the candidate 

compound is compared to the level of e7q>ression of CPP mRNA or protein in the absence of the 

candidate con^Knmd. The candidate compound can then be identified as a modulator of CPP 

20 cTqnression based on this conq)arison. For exanq)le, when e?q)ression of CPP mSNA or protein is 

greater (statistically significantly greater) in the presence of the candidate compound liian in its 

absence, the candidate conq)Ound is identified as a stimulator of CPP mRNA or protein expression. 

Altematively, when expression of CPP mKNA or protein is less (statistically significantly less) in the 

presence of the candidate compound than in its absence, the candidate compound is identified as an 

25 inhibitor of CPP mRNA or protein expression. The level of CPP mRNA or protefai expression in the 

cells can be determined by methods described herein for detecting CPP mRNA or protein. 

In yet another aspect of the invention, the CPP can be used as "bait proteins" in a two-hybrid 

assay or three-hybrid assay (see, e.g., U.S. Pat. No. 5,283,317; Zervos et al, (1993) CeU 72:223- 

232; Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel et al. (1993) Biotechniques 

30 14:920-924; Iwabuchi et al. (1993) Oncogene 8: 1693-1696; and Brent WO94/10300, the disclosures 

of which are incorporated herein by reference in their entireties), to identify other proteins, which 

bind to or interact with CPPs ("CPP-binding proteins" or "CPP-bp") and are involved in CPP 

activity. Such CPP-binding proteins are also likely to be involved in the propagation of signals by the 

CPP or CPP targets as, for example, downstream elements of a CPP-mediated signaling pathway. 

35 Ahematively, such CPP-binding proteins are likely to be CPP inhibitors. 
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The two-hybrid system is based on the modular nature of most transcription &ctors, which 

consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two diferent 

DNA constructs . hi one construct, the gene that codes for a CPP or a fiagment thereof is fused to a 

gene encoding the DNA binding domain of a known transcription fector (e.g., GAL-4). In the other 

construct, a DNA sequence, from a Ubrary of DNA sequences, that encodes an unidentified protein 

("prey" or "sample") is fiised to a gene that codes for the activation domain of the known 

transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming a CPP 

-dependent complex, the DNA-binding and activation domains of the transcription factor are brought 

into close proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) which is 

operably linked to a transcriptional regulatory site responsive to the transcription fector. Expression 

of the reporter geae can be detected and cell colonies containing the functional transcription fector 

can be isolated and used to obtain the cloned gene which encodes the protein which interacts with the 

CPP. 

This invention further pertains to novel agents identified by the above-described screening 
assays and to processes for producing such agents by use of these assays. Accordingly, in one 
embodiment, the present invention includes a confound or agent obtainable by a method comprising 
the steps of any one of the aforementioned screening assays (e.g., cell-based assays or cell-free 
assays). 

Accordingly, it is within the scope of this invention to further use an agent identified as 
described herein in an appropriate animal model. For example, an agent identified as described herein 
(e.g., a CPP mo^ila*i"g agent, or a CPP -binding partner) can be used in an animal model to 
determine the efficacy, toxicity, or side effects of treatment with such an agent. Altematively, an 
agent identified as described herem can be used in an animal model to determine the mechanism of 
action of such an agent. Furthermore, this invention pertains to uses of novel agents identified by the 
above-described screening assays for treatments as described herem. Preferably, such agents activate 
or enhance CPP biological activity. ' 

The present invention also pertains to uses of novel agents identified by the above-described 
screening assays for diagnoses, prognoses, prevention, and treatments as described herein. 
Accordingly, it is within the scope of the present invention to use such agents in the design, 
formulation, synthesis, manufecture, and/or production of a drug or pharmaceutical composition for 
use in diagnosis, prognosis, or treatment, as described herein. For exanq)le, in one embodiment, the 
present invention includes a method of synthesizing or producing a drug or pharmaceutical 
con5)osition by reference to the structure and/or properties of a compound obtainable by one of the 
above-described screening assays. For example, a drug or pharmaceutical composition can be 
synthesized based on the structure and/or properties of a compound obtained by a method ia which a 
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cell which e?q)resses a CPP target molecule is contacted with a test conqpound and the abUity of the 

test compound to bind to, or modulate the activity o^ the CPP target molecule is determined. In 

another exemplary embodiment, the present invention includes a method of synthesizing or producing 

a drug or pharmaceutical composition based on the structure and/or properties of a compound 

obtainable by a method in which a CPP or biologically active portion thereof is contacted with a test 

compound and the ability of the test compound to bind to, or modulate (e.g., stimulate or inhibit) the 

activity o^ the CPP or biologically active portion thereof is deteraiined. 

Animal based (Amg screening 

It is also advantageous to carry out drug screening assays in vivo. In vivo screening assays 
are carried out in nonhimian aTiimals to discover efifective CPP modulators that may play a role in. 
cardiovascular disease. Aninoal-based model systems of cardiovascular disease include, but are not 
limited to, non-reconibinant animals and transgenic animals. 

Non-recombinant animal models for cardiovascular disease may include, for exantq>le, 
genetic models. Such genetic cardiovascular disease models include apoB or apoR deficient pigs 
(Rapacz, et al., 1986, Science 234:1573-1577) and Watanabe heritable hyperlipidemic (WHHL) 
rabbits (Kita et al., 1987, Proc. Nad. Acad. Sci U.S.A. 84: 5928-5931). Non-recombinant, non- 
genetic animal models of atherosclerosis may include, for exanq)le, pig, rabbit, or rat models in 
which the animal has been exposed to either chemical woundiag through dietary supplementation of 
LDL, or mechanical woimding through balloon catheter angioplasty, for example. 

As indicated in the prior art (Ferns, G. A. A. et al. (1991) Science, 253:1129-1132) the rat 
carotid artery injxuy model of restenosis can be a useful indication of potential therapeutic action. An 
example of this method is described in US Patent 6500859, the disclosure of which is incorporated 
herein by reference. Briefly, the protocol approved by the National Institute on Aging Animal Care 
and use Committee used 6 month Wistar rats firom the GRC colony anesthetized with 20 mg/kg body 
weight pentobarbital, 2 mg/kg body weight ketamine, and 4 mg/kg body weight xylazine 
intraperitoneally . The left external carotid artery was cannulated with 2-French Fogarty 
embolectomy catheter, inflated with saline and passed three times up and down the common carotid 
artery to produce a distending, deendotheliaUzing injury. The animals were treated with an 
appropriate dosage of the test substance or with vehicle alone (e.g., based on body weight per day in 
an appropriate solution such as 1:2:2:165 DMSO:Cremophor EL:Dehydrated ethanol:phosphate 
buffered saline) by intraperitoneal injection beginning 2 hours after injury. Test substance or vehicle 
alone was administered once daily, as an intraperitoneal inj ection, for the next 4 days . After 1 1 days 
the animals (8 treated and 10 vehicle-treated) were anesthetized as above and the carotid artery was 
isolated and fixed in 10% buffered formalin and ^bedded in parafiBn. Cross sections of the carotids 
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were mounted on microscope slides and stained v/iQi hemat03cylni and eosin stain. The image of the 

carotid artery was projected onto a digitizing board and the cross sectional areas of the intima and 

the media were measured. Reduction of Hie neointimal area (thickening) indicates that the test 

substance is an effective antirestenosis agent. 

5 Interfering with the recirculation of bile acids from the lumen of the intestinal tract is foimd 

to reduce the levels of serum cholesterol in a causal relationship. Epidemiological data has 
accumulated which indicates such reduction leads to an improvement in the disease state of 
atherosclerosis (Stedronsky, Biochimica et Biophysica Acta, 1210, 255-287 (1994)). Inhibition of 
cholesteryl ester transfer prtrtein (CETP) has been shown to effectively modify plasma HDL/LDL 

10 . ratios, and is expected to check the progress and/or formation of certain cardiovascular diseases. 

Inhibition of CETP should lead to elevation of plasma HDL cholesterol and lowering of plasma LDL 
cholesterol, thereby providing a therapeutically beneficial plasma lipid profile (McCarthy, Medicmal 
Res. Revs., 13, 139-59 (1993)). An in vivo assay for compounds that inhibit rat ileal uptake of ^"^C- 
• Taurocholate into bile (CETP inhibition) is disclosed in US Patent 6489366 and Une, et al. 

15 Biochimica et Biophysica Acta, 833, 196-202 (1985), disclosures of which are incorporated herein 
by reference. 

Briefly, male Wistar rats (200-300 g) are anesthetized vnOi inactin (100 mg/kg). Bile ducts 
' are cannulated with a lOinch Imgth of PEIO tubing. The small intestine is to be exposed and laid out 
on a gauze pad. A canulae (1/8" luer lock, tapered female adapter) is inserted at 12 cm from the 

20 junction of the small intestine and the cecum. A sUt is cut at 4 an firomthis same junction (utilizing a 
8 cm length of ileum). Twenty milliliters of warm Dulbecco*s phosphate buffered saline, pH 6.S 
(PBS) is to be used to flush out the mtestine segment. The distal op^ung is cannulated with a 20 cm 
length of silicone tubing (0.02" I.D..tim6s.0.037'* O.D.). The proximal cannulae is hooked up to a 
peristaltic punq> and the intestine is washed for 20 min with warm PBS at 0.25 ml/min. Temperature 

25 of the gut segment is to be monitored continuously. At the start of the experiment, 2.0 ml of control 
sample (^"^C-taurocholate at 0.05 mCi/mL widi 5 mM non-radiolabeled taurocholate) is loaded into 
the gut segment with a 3 ml syringe and bile sample collection is begun. Control sanq)le is infiised at 
a rate of 0.25 ml/min fer 21 min. Bile samples fiactions are to be collected every 3 minute for the 
first 27 minutes of the procedure. After the 21 min of ssunple infiision, the ileal loop is to be washed 

30 out with 20 ml of warm PBS (using a 30 ml syringe), and then the loop is to be washed out for 21 
min with warm PBS at 0.25 ml/min. A second perfusion is initiated as described above but this with 
test compound being administered as well (2 1 min administration followed by 21 min of wash out) 
and bile sampled every 3 min for the first 27 min. If necessary, a third perfusion is performed as 
above that typically contains the control sample. 

35 In addition, measurement of hepatic cholesterol concentration is a usefiil assay for 
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detemiming the effectiveness of a test substance against cardiovascular disorders. In this assay, liver 
tissue is weighed and homogenized in cblorofbnn:methanol (2: 1). Afier homogenization and 
CCTtrifugation the supernatant is separated and dried under nitrogen. The residue is to be dissolved in 
isopropanol and the cholesterol content measured enzymatically, using a combination of cholesterol 
5 oxidase and peroxidase, as described by AUain, C. A. et al., Clin. Chem., 20, 470 (1974) (herein 
incorporated by reference). 

Similarly, serum cholesterol may be determined as follows. Total serum cholesterol is 
measured enzymatically using a commercial kit from Wako Fine Chemicals (Richmond, Va.); 
Cholesterol CI 1, Catalog No. 276-64909. HDL cholesterol may be assayed using this same kit after 
10 precipitation of VLDL and LDL with Sigma Chemical Co. HDL Cholesterol reagent. Catalog No. 
352-3 (dextran suljEate method). Total serum triglycerides (blanked) (TGI) is also assayed 
enzymatically with Sigma Caiemical Co. GPO-Trinder, Catalog No. 337-B. YLDJ. and LDL 
(VLDL+LDL) cholesterol concentrations are calculated as the difference between total and HDL 
• diolesteiol. A reduction in VLDL+LDL cholesterol in the test substance-treated sample relative to 
15 control is indicative of an effective anti-cardiovascular disorder agent. 

A dog model for evaluating lipid lowering drugs may also be utilized, for example, as 
described m US Patent 6489366. 

Briefly, male beagle dogs, obtained from a vendor such as Marshall &rms and weighing 6- 
12 kg are fed once a day for two hours and given water ad Ubitum. Dogs may be randomly assigned 
20 to a dosing groups consisting of 6 to 12 dogs each, such as: vehicle, i.g.; 1 mg/kg, i.g.; 2 mg/kg, i.g.; 
4 mg/kg, i.g.; 2 mg/kg, p.o. (powder in capsule). Intra-gastric dosing of a therapeutic material 
dissolved in aqueous solution (for example, 0.2% Tween 80 solution [polyoxyethylene mono-oleate, 
Sigma Chemical Co., St. Louis, Mo.]) may be done using a gavage tube. Prior to initiating dosing, 
blood samples may be drawn from the cephalic vein in the morning before jBseding in order to 
25 evaluate serum cholesterol (total and HDL) and triglycerides. For several consecutive days animals 
are dosed in tiie morning, prior to feeding. Animals are to be allowed 2 hoiirs to eat before any 
remaining food is removed. Feces are to be collected over a 2 day period at the end of the study and 
may be analyzed for bile acid or lipid content. Blood samples are also to be taken, at the end of the . 
treatment period, for comparison with pre-study serum Upid levels. Statistical significance will be 
30 determined using the standard student's T-test with p<0.05 . 

Serum lipid measurement is measured similarly. Blood is collected from the cephalic vein of 
&sted dogs in serum separator tubes (Vacutainer SST, Becton Dickinson and Co., Franklin Lakes, 
N.J.). The blood is centrifuged at 2000 ipm for 20 minutes and the serum decanted. Total diolesterol 
may be measured in a 96 well format using a Wako enzymatic diagnostic kit (Cholesterol CJ£) 
35 (Wako Chemicals, Richmond, Va.), utilizing the cholesterol oxidase reaction to produce hydrogen 
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peroxide which is measured colorimetiically. A standard curve from 0.5 to 10 ug cholesterol is to be 

prepared in the first 2 columns of the plate. The serum samples (20-40 ul, depending on the expected 
lipid concentration) or known serum control samples are added to separate wells in duplicate. Water 
is added to bring the volume to 100 ul in each well. A lOOulaliquotof color reagent is added to each 
5 well and the plates will be read at 500 mn after a 15 minute incubation at 37 degrees centigrade. 

HDL cholesterol may be assayed using Sigma kit No. 352-3 (Sigma Chemical Co., St. 
Louis, Mo.) which utilizes dextran sulfete and Mg ions to selectively precipitate LDL and VLDL. A 
volume of 150 ul of each serum sample is to be added to individual microfixge tubes, followed by 15 
ul of HDL cholesterol reagent (Sigma 352-3). Samples are to be mixed and centrifiiged at 5000 ipm 
10 for 5 minutes. A 50 ul aliquot of the supernatant is to be then mixed with 200 ul of saline and 
assayed using the same procedure as for total cholesterol measurement. 

Triglycerides are measured using Sigma kit No. 337 in a 96 well plate format. This 
procedure will measure glycerol, following its release by reaction of triglycerides with lipoprotein 
lipase. Standard solutions of glycerol (Sigma 339-1 1) ranging from 1 to 24 ug are to be used to 
15 generate the standard curve. Serum samples (20-40 ul, depending on the e?qpected lq)id 

concentration) are added to wells in duplicate. Water is added to bring the volume to 1 00 ul in each 
well and 100 ul of color reagent is also added to each well. After mixing and a 15 minute incubation, 
the plates will be read at 540'nm and the triglyceride values calculated from the standard curve. A 
replicate plate is also to be run using a blank enzyme reagent to correct for any endogenous glycerol 
20 in the serum samples. 

Test compounds may be evaluated for their effect on serum glucose and serum insulin in 
db/db tnice (C578BL/BCsJ-db/db Jcl) as described in US 6462046, disclosure of which is 
incorporated herein. The compounds are dissolved in a vehicle (e.g., consisting of 2% TweenSO in 
distilled water) and administered orally. Dosage is determined by body weight. All aspects of the 
25 work including e>q)erimentation and disposal of the animals is performed in general accordance with 
the International Guiding Principles for Biomedical Research Involving Animals (CIOMS 
Publication No. ISBN 92 90360194, 1985). Glucose-HA Assay kits (Wako, Japan) are used for 
determination of serum glucose and ELISA Mouse Insulin Assay kits (SPI bio, France) are utilized 
for detennination of insuliiL An appropriate positive control is troglitazone (Helios Pharmaceutical, 
30 Louisville, Ky.). 

The animals are divided into twenty groups of four animals each. The aninoals weigh 52 +/- 
5 gms at age 8-10 weeks. Dining the e3q)erim0tri; the animals are provided free access to laboratory . 
chow CFwusow Industry Co., Taiwan) and water. Prior to any treatment a blood sample 
(pretreatment blood) is taken from eadi animal. Four groups of animals, the v^cle groups, receive 
35 only doses of the vehicle. Each of the vehide groups receive 100, 30, 10 or 1 val/kg body weight of 
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the vehicle orally. A triglitazone solution (10 ml/kg body weight in tween 80/water) is administered 
orally to the four positive control groups in doses of 100, 30, 10 and 1 ml/kg body weight 
respectively. The test compound is similarly administered orally as a solution to four groups of 
aniT Tials with each group receiving a different dose of the coiiq)ound. The vehicle, positive control 
and test compound solutions are administered to the groups immediately, 24 hours and 48 hours after 
drawing the pretreatment blood. Blood is withdrawn (post treatment blood) 1.5 hours after 
administration of the last dose. The serum glucose are determined enzymatically (Mutaratose-GOD) 
and the insulin levels by ELISA (mouse insulin assay Idt). The mean SEM of each group is 
calculated and the percent inhibition of serum glucose and insulin obtained by comparison between 
pretreatment blood and post treatment blood. The percentage of reduction of the serum glucose and 
mgnlin levels in the post treatment blood relative to the pretreatment blood is determined and the 

. Unpaired students t test applied for the comparison between the control and test solution groups and 
the vehicle group. A significant difference is ccmsidered at P<0.05. TrogUtazone, as an effective anti- 
cardiovascular disorder agent, results in a reduced glucose level at 10 mg/kg body weight (25 +/- 
2%). . 

US Patent 61213 19, disclosure of which is incorporated herein, describes an assay for the 
progression of atherosclerosis in hypercholesterolmiic rabbits. The rabbits are sacrificed and aortas 
obtained. The aortas are stained with sudan-4 andtiie exteat of staining analyzed. The percent aortic 
surface area covered by lesions in test substance treated and untreated lipid-fed rabbits is graphed. 
The aortas of the rabbits treated with an effective anti-atherosclerotic agent have less staining, 
indicating decreased atherosclerosis. In addition, sections of the aortas are immunostained for 
VCAM-1 e?q)ression or macrophage accumulation using antibodies for VCAM-1 or Ram-1 1 
antigen. Reduced VCAM-1 expression and macrophage accumxilation compared to control treated 
samples are indicative of an effective agent. 

Reduction in LDL cholesterol may also be determined in a primate model. For example, 
Cynomolgus monkeys are made hypercholesterolemic prior to test compound dosing by feeding a 
higji fet cholesterol diet. The monkeys are then dosed orally with the test conq)ound or control 
vehicle for two weeks. A reduction in the percentage serum IDL cholesterol in the monkeys over this • 
- time period is indicative of an effective anti-atherosclerotic agent 

Pharmaceutical Compositions 

When polypeptides of the present invention are expressed in soluble form, for exaxaple as a 
secreted product of transformed yeast or mammalian cells, ikcy can be purified according to standard* 
procedures of the art, including steps of ammonium sul&te precipitation, ion exchange 
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chromatograplq^, gel fittration, electrophoresis, afiSmty chromatography, according to, e.g., ''Enzyme 
Purification and Related Techniques," Methods in Enzymology, 22:233-577 (1977), and Scopes, R., 
Protein Purification: Principles and Practice (Spiinger-Verlag, New York, 1982) provide guidance 
in such purifications. Likewise, \;^en polypeptides of the invention are expressed in insoluble form, 
for example as aggregates or inclusion bodies, they can be piulfied by appropriate techniques, 
including separating the inclusion bodies from disrupted host cells by centrifugation, solubli2mg the 
inclusion bodies with chaotropic and reducing agents, diluting the solubilized mixture, and lowering 
the concentration of chaotropic agent and reducing agent so that the polypq)tide takes on a 
biologically active conformation. The latter procedures are disclosed in the following references, 
which are incorporated by reference: Winkler et al. Biochemistry, 25: 4041-4045 (1986); Winkler et 
al. Biotechnology, 3: 992-998 (1985); Koths et al, U.S. patent 4,569,790; and European patent 
appUcations 86306917.5 and 86306353.3. 

Compounds capable of modulating and preferably increasing a CPP biological activity, 
preferably small molecules but also including peptides, CPP nucleic add molecules, CPP, and anti- 
CPP antibodies of the invention can be mcorporated into pharmaceutical compositions suitable for 
administration. Such compositions typically comprise a pharmaceutically acceptable carrier. As used 
herein the language "pharmaceutically acceptable carrier" is intended to include any and all solvents, 
dispersion media, coatings, antibacterial and antifungal ag^its, isotonic and absorption delaying 
agents, and the like, conopatible with pharmaceutical administration. The use of such media and 
agents fer pharmaceutically active substances is well known in the art. Except inso&r as any 
conventional media or agent is incompatible with the active cQnq)ound, use thereof in the 
compositions is contemplated. Supplementary active compounds can also be incorporated into the 
conqpositions. 

A pharmaceutical composition of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include parenteral, e.g., 
intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (topical), transmucosal, 
and rectal administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous 
appUcation can include the following components: a sterile diluent such as water for injection, saline 
solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic add or 
sodium bisulfite; chelating ag&ais such as ethylenediauMnetetraacetic acid; buffers such as acetates, 
citrates or phosphates and agents for the adjustmoit of tonicity such as sodium chloride or dextrose. 
pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in an5)oules, disposable syringes or multiple dose vials made 
of glass or plastic. 
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Phannaceutical compositLons suitable for injectable use include sterile aqueous solutions 

(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of 

sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include 

physiological saline, bacteriostatic water, Cranophor EL® (BASF, Parsippany, N.J.) or phosphate 

buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent 

that easy syringability exists. It must be stable under the conditions of manufecture and storage and 

must be preserved against the contaminating action of microorganisms such as bacteria and fimgi: 

The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol 

(for example, glycerol, propylene glycol, and liquid polyethylme glycol, and the like), and suitable 

mixtures thereof The proper fluidity can be maintained, for example, by the use of a coating such as 

lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of 

sur&ctants. Prevention of the action microorganisms can be adiieved by various antibacterial and 

anti&ngal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the 

Uke. Jn many cases, it will be preferable to include isotonic agents * for example, sugars, polyalcohols ' 

such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 

compositions can be brought about by including in the conq}osition an agent which delays 

absorption, for exanq>le, aluminum monostearate and gelatin. 

Where the active compound is a protein, e.g., an anti-CPP antibody, sterile injectable 
solutions can be prepared by incorporating the active compound in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating the active 
compound into a sterile vehicle which contains a basic dispersion medium and other required 
ingredients firom those eniunerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred mefliods of preparation are vacuum drying and fireeze-drying which 
.yields a powder of the active ingredient plus any additional desired ingredient fi-om a previously 
sterile-filtered solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form of 
tablets, troches, or capsules. For administration by inhalation, the compounds are deUvered in the 
form of an aerosol spray firom pressured container or dispenser which contains a suitable propellant, . 
e.g., a gas such as carbon dioxide, or a nebulizer. Systemic administration can also be by 
transmucosal or transdermal means. For transmucosal or transdermal administration, pexietrants 
^propriate to the barrier to be permeated are used in the fi>nnulation. Such penetrants are generally 
known in the art, and include, for e3cample, for transmucosal administration, detetgents, bile salts, 
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and fusidic acid derivatives. Traosmucosal adiniiiistratioii can be accomplished through the use of 
nasal sprays or suppositories. For transdermal administradon, the active compounds are formulated 
into ointments, salves, gels, or creams as generally known in the art. Most preferably, active 
compound is delivered to a subject by intravenous injection. 
5 la one embodiment, the active compounds are prepared with carriers that will protect the 

compound against rapid elimination from the body, such as a controlled release formulation, 
including implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers 
can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, 
polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be apparent 

10 to those skilled in the art. The materials can also be obtained commercially from Alza Corporation 
and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically acceptable 
carriers. These can be prepared according to methods known to those skilled in the art, for exanxple, 
as described in U.S. Pat. No. 4,522,81 1, the disclosure of which is incorporated herein by reference 

15 in its entirely. 

In a fiuther embodiment, the active CQnq>ound may be coated on a microchip drug delivery . 
device. Such devices are usefrd for controUed delivery of proteinaceous compositions iri^ 
bloodstream, cerebrospinal fluid, lynqjh, or tissue of an individual without subjecting such 
conq)ositions to digestion or subjecting the individual to injection. Methods of using microchip drug 

20 delivery devices are described in US Patents 6,123,861 and 5,797,898 and US Patent application 
200201 19176A1, disclosures of which are hereby incorporated in their entireties. 

It is especially advantageous to fr>rmulate oral or preferably parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be treated; each 

25 unit containing a predetermined quantity of active compound calculated to produce the desired 

therapeutic effect in association with the required pharmaceutical carrier. The specification for the 
dosage unit forms of the invention are dictated by and directiy dependent on the imique 
characteristics of the active compound and the particular therapeutic ejffect to be achieved, and the 
limitations inherent in the art of compounding such an active compound for the treatment of 

30 individuals. 

Toxicity and therapeutic efl&cacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimtotal animals, e.g., for determining the LDSO 
(the dose lethal to 50% of the population) and the BD50 (the dose therapeutically effective in 50% of 
the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it 
35 can be e:jq>ressed as the ratio LD50/ED50. Compounds which eriiibit large tiierapeutic indices are 
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preferred. While compoimds that exhibit toxic side e£Eects may be used, care should be taken to 
design a delivery system that targets such compounds to the site of affected tissue in order to 
miniinize potential damage to uninfected cells and, thereby, reduce side eflfe^ « . 

The data obtained from the cell culture assays and animal studies can be used in formulating 
a range of dosage for use in humans. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary 
within this range depending upon the dosage form employed and the route of administration utilized. 
For any compound used in the mefliod of the invention, the therapeutically eJ&ctive dose can be 
estimated initially from cell culture assays. A dose may be formulated in animal models to achieve a 
circulating used to more accurately determine useful doses in humans. Levels in plasma may be 
measured, for example, by high performance liquid chronaatography. 

The phamiaceutical compositions can be included in a container, pack, or dispenser together 
with instructions for administratiQn. 

Therapeutic Uses ofCPPs 

The CPPs, CPP modulators, and anti-CPP antibodies of the invention can be used in the 
treatment or prevention of CPP-related disorders. Unlike manufectured small molecule therapeutics 
that may cause an immune reaction or prove to be toxic to tiie host, thepq)tides of the invention are 
normaUy secreted and are thought not to be toxic to a subject to vdiicht^ Thus, 
in one aspect the invention relates to pharmaceutical compositions containing CPP as active 
ingredients, preferably containing a pharmaceutically acceptable carrier or diluent. The carrier or 
diluent is preferably adapted for oral, intravenous, intramuscular or subcutaneous administration. 
CPPs can also be prepared as solutions for inhalation. Phamiaceutical compositions may conqprise, 
consist or consist essentially any of the CPPs described herein. Optionally, a CPP can be 
administered as a propeptide or prepropeptide. 

A number of agents are useful for the treatment and prevention of cardiovascular disorders. 
Such agents may be used advantageously in combination with a CPP. 

For example, cell cycle inhibitors and proto-oncogenes (Simari and Nabel, Semin. Intervent. 
Cardiol. 1:77-83 (1996)); NO (nitric oxide) donor drugs; pro-apoptotic agents such as bcl-x 
(Pollman et al.. Nature Med. 2:222-227 (1998)); herpes virus thymidine kinase (tk) gene and 
systemic ganciclovir (Ohno et al.. Science 265:781-784 (1994); Guzman et al., Proc. Nafl. Acad. 
Sd. USA 91:10732-10736 (1994); Chang et al., Mol. Med. 1:172-181 (1995); and Simari et al.. 
Circulation 92:1-501 (1995)) have been ©qiloited to treat atherosclerosis, restenosis and neointimal 
smooth muscle proliferation. Disclosures of the above references are hereby incorporated in their 
entireties. 
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Anti-thrombotic agents useful in combination with tiie peptides of the invention include, for 
example, inhibitors of the Hb/IIIa integrin; tissue fector inhibitors; and anti-thrombia agents. An 
antianhythmic agent, such as a local anesthetic (class I agent), sympathetic antagonist (class n 
agent), antifibrillatory agent (class T£L agent) calcium channel agent (class IV agent) or anion 
5 antagonist (class V agent) as described in Vufcmir, Am. J. Emer. Med. 13:459-470 (1995); Grant, 
PACE 20:432-444 (1997); Assmann L, Curr. Med. Res. Opin. 13:325-343 (1995); and Lipka et al., 
• Am. Heart J. 130:632-640 (1995), disclosures of which are hereby incoiporated by reference in their 
entireties, may also be used. Exanq)les of class I agents include: procainamide; quinidine or 
disopyramide; Udocaine; phenj^in; tocainide or mexiletine; encainide; flecainide; lorcainide; 
10 propafenone (HI) or moricizine. Sympathetic antagonists include: propranolol, esmolol, metoprolol, 
atenelal, or acebutolol. Examples of antifibrillatory agents are bretylium, amiodarone, sotalol (U) or 
N-acetylprocainamide. Class IV agents include verapamil, diltiazem, and bepridil, and anion 
antagonists such as alinidine. 

Congestive heart feulure therapeutic agents include TNF inhibitors such as Embrel.TM. 

15 (Immunex Corp.; Seattle, Wash.), TBCl 125 1, or an ACE (angiotensin converting enzyme) inhibitor, 
such as Natrecor (nesiritide; Scios, Inc.). Angiogenic agents, for example, recombinant VEGF 
isoforms, such as rhVEGF developed by Genentech; a nucleic acid molecule encoding the 121 amino 
acid isoform of VEGF (BioByPass.TM.; GenVec/Parke Davis); or a nucleic add encoding VEGF-2 
(Vascular Genetics, toe); FIBLAST.TM., a recombinant form of FGF-2 being developed by Scios, 

20 Inc. (Mountain View, Calif.) and Wyeth Ayerst Laboratories (Radnor, Pa.), GENERX.TM., or an 
adenoviral gene therapy vector encoding FGF-4 developed by Collateral Therapeutics (San Diego, 
Calif) and Schering AG (see Miller and Abrams, Gen. Engin, News 18:1 (1998), disclosure of 
which is hereby incorporated by reference in its entirety), are also useful in combination with the 
CPP-related compositions of the invention. Finally, calcium antagonists, such as amlodipine 

25 (Marche et al., lot. J. Cardiol. 62(Suppl.):S17-S22 (1997); Schachter, Jnt. J. Cardiol. 

62(Suppl.):S85-S90 (1997)); nicardipine; nifedipine; propanolol; isosorbide dinitrate; diltiazem; and 
isradipine (Naylra- (Ed.) Calcium Antagonists pages 157-260 London: Academic Press (1988); 
Schachter, Int. J. Cardiol. 62(Suppl.):S9-S15 (1997)) are also advantageous therapeutic agents for 
cardiovascular disorders. 

30 

Ebving generally described this invention, a further understanding can be obtained by 
reference to certain specific examples which are provided herein for puiposes of iUustmtion only, 
and are not intended to be limiting unless otherwise specified. 
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Example 1: Characterization of CPP levels in experimental and control populations 

Subjects enrolled in the Duke Databank for Cardiovascular Disease were selected on the 
basis of coronary artery disease (CAD). A total of 24 1 CAD patients and control individuals were 
further matched for gender, age, and ethnicity and individuals with plasma abnormalities were 
excluded. A set of 53 CAD patients and a set of 53 control individuals were established. Six Uters 
of plasma were pooled from each set An aUquot of plasma was retained from each individual, thus 
allowing a positive result in the pooled sample to be confirmed for each member of the population. . 
Such confirmation is \^uable to erase possible confomding eflFects of an individual with an aberrant 
level of a specific polypqrtide that is not related to a cardiovascular disorder. Two and a half liters 
of pooled plasma from each population was subjected to separation by multiple chrcnnatography 
steps according to the Microprot.TM process as follows: 

Step 1: HSA/IgQ depletion 

125 ml frozen plasma were defrost and fittered on 0.45 pm sterile filter m a sterile hood. 

Filtrate was injected on two inline columns of respectively 300 ml of HSA ligand Sepharose 
fast Flow column (Amersham, Upsala, Sweden), 5cm ID, 15 cm length; and 100 ml Protein G 
Sepharose fest Flow coltmm (Amersham, Upsala, Sweden), 5 cm ID, 5 cm length. 

Columns were equiUbrated and washed with 50 mM P04 buffer, pH 7. 1, 0. 15M NaCl. Flow 
rate was 5 ml/min. 

Non-retained fraction (350 ml) was frozen until second step. Twenty runs were performed. 

Step 2: Gel Filtration /Reverse Phase Ccpture step 

Saixq}le from step 1 was defrosted and filtered on 0.45 pm sterile filta: in a sterile hood. 

Filtrate was injected on two in line gel filtration columns: 2 X 9.5 litres Superdex 75 
(Amersham, UK) coluhm, 14 an ID, 62 cm length. Column was equilibrated with 50mM P04 buffer 
pH 7.4, 0.1 M NaCl, 8M urea. Hydrophobic impurities were retained on a reverse phase precolumn: 
150 ml PLRPS (Polymer Labs, UK). Precolumn was switched for san^le injection. Gel filtration 
was perfonned at a flow rate of 40 ml/min. 

Low molecular weight proteins (<20 kDa) were oriented to in line reverse phase capture 
column: 50 ml PLRPS 100 angstroms (Polymer labs, UK). The three-way valve controlling injection 
on PLRPS column was switched at a cut-off of 33 mAU (280 nm) to send gel filtration eluate into 
reverse phase capture column. This cut-off value wais established by first using SDS-PAGE to 

84 



wo 2004/061096 PCT/EP2004/000027 
provide an estimated range of OD values and by subsequently evaluating three cut-off values (high, 
median and low values of OD range). The final cut-off value was chosen to maximize the low 
molecular weight protein obtained, with a low molecular protein proportion of at least 85%. Low 
molecular weight proteins and peptides were eluted from reverse phase capture PLRPS column by 
one column volume gradient of 0. 1% TFA, 80% CH3C3>T in water. 

Eluate fractions (50 ml) were frozen until next step. Twenty runs were performed. At the end of this 
stq), all reverse phase eluates were defrosted, pooled (1 liter) and shared in 7 polypropylene 
containers (143 ml). Containers were kept at -20**C until use for next step. 

Step 3: Cation Exchange 

Sample from step 2 (147 ml) was defrosted and mixed with an equal volume of cation 
exchaoge buffer A (Gly/HCl buffer 50 mM, pH 2.7, urea 8M). 

Sample was injected on a 100 ml Source 15S column (Amersham, Upsala, Sweden), 35 mm 
ID, 100 mm length. Column was equiUbrated and washed w^ lOml/min. 

Proteins and peptides were eluted with step gradient frcnn 100% buffer A until 100 % buffer 
B (buffer A cOTtaining IM NaCl): 

3 column volumes 7.5% B (75 mM NaCl) 

3 colunm volumes 10% B (100 mM NaCl) 

3 column volumes 17.5% B (175 mMNaCl) 

2 column volumes 22.5% B (225 mM NaCl) 

2 column volumes 27.5% B (275 mM NaCl) 

2 column volumes 100% B (1 M NaCl) 

45 to 60 fractions were collected based on peak. Seven runs were conducted. After 7 runs 
were achieved, fractions were pooled intra and inter run in order to obtain 18 fractions. Fractions 
were kept at -20Xuntil use for next step. . 

Step 4: Reduction/AlkyUOion and Reverse Phase HPLC Fractionation 1 

After adjusting the pHto 8.5 with concentrated Tris-HCl, each of the 18 cation exchange 
fractions was reduced with dithioerythritol (DTE, 30 mH ' 
iodoacetamid (120 mM, I hour 25^C in the dark). The latter reaction was stopped with the addition 
of DTE (30 mM) followed by acidification (TFA, 0.1%). The fractions were then injected on an 
Uptispher C8, 5 microm, 300 angstroms column (Interchim, France), 21 mm ID, 150 mm length. 
Injection was performed with a 10 ml/min flow rate. 

C8 column was equilibiated and washed with 0. 1% TFA in water (solution A). Proteins and 
peptides were eluted with a biphasic gradient from 100% A until 100% B (0.1% TFA, 80% CH3CN 
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in water) in 60 min. Flow rate 20 ml/min. Thirty ftactions of 40 ml were collected. 

Based on the measured qptical density (OD) at 280 nm of each fraction, which reflects the 
protein concentration in that fiaction, aliquots of siinilar protein content were created for each 
fraction. 

5 All aliquots were frozen and kept for further use cxc&pt one per fraction which was dried 

with a Speed Vac (Savant, Fischer, Geneva) after addition of 500 microl 10% glycerol in water in 
each fraction, in order to prevent excess drying. Dried fractions were kq>t at -20**C until use for next 
step. 

10 Step 5: Reverse Phase HPLC Fractionation 2 

Dried samples from stq) 4 were resuspended in 1 ml of solution A (0.03% TFA in water) 
and injected on a Vydac LCMS C4 column, 5 micrometers, 300 angstroms (Vydac, USA), 4.6 imn 
ID, 150 mm length. Flow rate was 0,8 ml/min. 

C4 column was equilibrated and washed with solution A and proteins and peptides were 

15 eluted with a biphasic gradient ad^ted to elution position of the sanq)le in Reverse Phase HPLC 
Fractionation 1 . Mact mass data were acquired using Electrospray Ion Trap Mass spectrometry. 
Sixteen difGsrent gradients were used with a CH3CN concentration range minus and plus 5% 
CH3CN of RPl fraction corresponding solv&A conceotration. For proteins eluted in RPl with a 
solvent concentration equal to or greater than 30% CH3CN, the starting elution conditions for the 

20 RP2 gradient was set, in CH3CN percentage, at the RPl elution concentration minus 30%. Twenty- 
four eluted fractions were collected in a deep well plate, adopting optimized difJferent collection 
configurations designed for optimal SpeedVac concentration and further robotic treatment. 

Step 6: Mass detection 

25 About 13,000 fractions were collected following reverse phase HPLC fractionation 2 into 

96-well deep well plates (DWP). A small proportion (2.5%) of the volume was diverted to online 
analysis using LC-ESI-MS (Bruker Esquire). Aliquots of undigested proteins were mixed with 
MALDI matrices, and spotted on MALDI plates together with mass calibration standards and 
sensitivity standards. Automated spotting devices (Bruker MALDI sample prep. Robots) were used. 

30 Two different MALDI matrices were employed: sinapic add (SA), also known as sinapuiic add, 
trans-3,5-<limedio>g^-4-liydroxycinnamic add, and alpha-cyano-4-hydro:Q^dnnannc add (HCCA). 
MALDI plates were subjected to mass detection using Bruker Reflex III MALDI MS appaiati. The 
96-well plates were stored at +4 C. 

96-well plates (DWP) were recovered and subjected to two sequential concentration steps. 

35 Volumes were concentrated from 0.8 ml to about 50 microl per well by drying with a SpeedVac, and 
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then resolubilized to ca. 200 microl and reconcentrated to aboxrt 50 microl per well, and stored at +4 
C, Proteins were then digested by re-bufifering, adding trypsin to the wells, sealing and incubating 
the plates at 37 C for 12 hours, followed by quenching (addition of foimic acid to bring the pH down 
to 2.0). The concentration of trypsin to be added to the wells was adjusted based on the OD at 280 
nm recorded for each particular fraction. This msured an optimal use of trypsin and a complete 
digestion ofthe most concentrated fractions. Automated spottiiig devices (Bruker MALDI sample 
prep. Robots) were used to deposit a volume from each well, pre-mixed with a HCCA matrix onto a 
MALDI plate together with sensitivity and mass calibration standards. MALDI plates were 
analyzed using a Bruker Reflex m MALDI MS device. Contents from each well of tiie 96 well 
plates were analyzed with LC-ESI-MS-MS Bruker Esquire ESI lon-Tiap MS devices. 

Step 7: Detection and Identification of Low Abundance Peptides in Human Plasma 

Separated fractions are fiuther subjected to mass spectrometry (both matrix-assisted laser 

desorption/ionization (MALDI) and MS-MS) for separation and detection. 

Litact mass data. Peptide Mass Fipgerprints and peptide sequence data were integrated for 

protein identification and characterization. Proteins were identified using Mascot software (Matrix 

Science Ltd., London, UK), and results from peptide identification were checked by manual analysis 

of the spectra. 

Among the proteins identified by this process, CalgranuUn A (S 100 calcium-binding protein 
A8, of SwissProt accession number P05 109), was found to be expressed to a greater extent in the 
pooled sample from controls than in the pooled sample fix)m CAD patients (e.g., peptides from the 
protein were observed in twice as many control fractions compared with disease fractions, and the * 
cumulated scores obtained during mass spectra identification of this protein were 2.5-fold higher for 
the control sample). Calgranulin A has been characterized as a pro-inflammatoiy protein (Odink, et 
al.. Nature 330 (6143), 80-82 (1987) and numerous later references). It is expressed by 
extravasating myeloid cells during inflammatoiy responses, where it binds to a glycosaminoglycan 
structure on epithelial cells CRobinson, et al., JBC 277:3658-65 (2002)). Interestingly, PCT 
publication WO 00/61742 discloses the use of Calgranulin A for the treatment of carfiac 
insuflBLciency, e.g. caused by arteriosclerosis. Moreover, PCT publication WO 00/18970 discloses 
the use of Calgranulin A as an inhibitor of vascular membrane growth for prevention of myocardial 
tn&rction and hypertension. It appears therefore that the protein separation and identification 
^proach described herein is efficient at proviidix^ proteins which, when detected at higher levels in 
the control sample than in the disease sanqple, have a benefidaiy efifect for the treatment ofthe 
studied disease. 

Conversely, the methods of protem separation and identification described in this Exanq)le 
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have allowed the ideatificadon of the Matrix Gla Protein (of SwissProt accession numbCT P08493) 
as overe>q)ressed in the pooled sanq)le from CAD patients by comparison wifli the pooled sample 
from controls (e.g., peptides from the protem were observed m almost twice as many disease 
fractions compared with control fractions, and the cumulated scores obtained during mass spectra 

5 identification of this protein were 2-fold higher for the disease sample). MGP is a vitamin K- 
dependent protein winch associates with the organic matrix of bone and cartilage. Mori, et al. 
demonstrated that MGP is capable of inhibiting vascular calcification (FEBS Letters 433 : 19-22 
(1998)). MGP levels are increased in atherosclerotic plaques as a likely feedback response to vessel 
calcification. PCT publications WO 01/02863 and WO 01/25427 describe MGP as a biomarker for 

10 atherosclerosis and cardiovascular disorders. It appears therefore that the protein separation and 
identification approach described herein is efficient at providing proteins which have a recognized 
use in the diagnosis of the studied disease. 

• - Example 2: Purification and Characterisation of CPPs 

15 Human plasma was firactionated by a series of multiple chromatography columns as desoibed in 
Exanq)le 1. The fractions were analysed by mass spectrometry. A distinct tryptic fragment was 
found as described in Table 1 and SEQ ID NO:5. 

The tryptic peptide listed in Table 1 (SEQ ID NO:5) was observed by MS-MS in the plasma 
of control uidividuals without Coronary Artery Disease. CPP peptides presmt in the plasma of CAD 

20 patients include CPPs 6 and 7 of SEQ ID NOs:3 and 4. The fiiU-length polypeptide sequences 

correlated with the tryptic peptide are described as SEQ ID NOs:l and 2. The listed peptides were 
not found in the CAD sample. The Microprot.TM process is able to detect very low abundance 
proteins with a plasma concentration in the range of 50 pM. Thus, the absence of the listed peptides 
in CAD plasma indicates that the CPPs are present at vanishingly low levels in CAD patients, if at 

25 all. 

The presence of a tryptic peptide indicates that a polypeptide with a molecular mass below 
20kD and comprising the amino acid sequence FTTLVQDLANAFQQEAQTSGK was present in the 
starting human plasma saniple. Such polypeptides include those comprising the amino acid sequence 
selected from the group consistmg of SEQ ID NOs:3-4. 
30 . ' 

Example 3: Chemical Synthesis of CPPs 

In this example, a CPP of the invention is synthesized. Peptide fragmdit intermediates are 
first synthesized and then assembled into the desired polypeptide. 

A CPP can initially be prepared in, e.g. 5 fiiagments, selected to have a Cys residue at the N- 
35 terminus of the fragment to be coupled. Tragment 1 is initially coupled to fragment 2 to give a first 
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product, then after preparative HPLC purification, the first product is coupled to firagment 3 to give 
a second product. After prq)arative HPLC purification, the second product is coupled to firagment 4 
to give a third product. Finally, after preparative HPLC purification, the third product is coupled to 
fragment 5 to give the desired polypeptide, which is purified and refolded. 

Thioester formation 

Fragments 2, 3, 4, and 5 are synthesized on a thioester generating resin, as described above. 
For this puipose the following resin is prepared: S-acetylthioglycolic acid pentafluorophenylester is 
coupled to a Leu-PAM resin under conditions essentially as described by Hackeng et al (1999). In 
the first case, the resulting resin is used as a starting resin for peptide chain elongation on a 0.2 mmol 
scale after removal of the acetyl protecting group with a 30 mm treatment wifli 10% 
mercaptoethanol, 10% piperidine in DMF. The N" of the N-terminal Cys residues of firagments 2 
through 5 are protected by couplmg a Boc-thioproline (Boc-SPr, i.e. Boc-L-thioproline) to the 
texminus of the respective chains instead of a Cys having conventional N** or protection, e.g. Brik 
et al, J. Org. Chem., 65: 3829-3835 (2000). 

Peptide synthesis 

Solid-phase synthesis is performed on a custom-modified 433A peptide synthesizer from 
^pUed Biosystems, using in situ neutralization/2-(lH-ben20triazol-l-yl)-l, 1,1,3,3- 
20 tetramethyluronium hexafluoro-phosphate (HBTU) activation protocols for stepwise Boc chemistry 
chain elongation, as described by Schnolzer et al. Int. J. Peptide Protein Res,, 40: 180-193 (1992). 
Each synthetic cycle consists of N**-Boc -removal by a 1 to 2 min treatment with neat TF A, a 1 -min 
DMF flow wash, a 10-min coupling time with 2.0 mmol of preactivated Boc-amino acid in the 
presence of excess DIEA and a second DMF flow wash. Na-Boc-amino adds (2 mmol) are 
25 preactivated for 3min with 1 . 8mmol HBTU (0.5M m DMF) in the presence of excess DIEA 
(6mmol). After coupling of Ghi residues, a dichloromethane flow wash is used before and after 
deprotection usmg TFA, to prevent possible high tenq)erature (TFA/DMF)-catalyzed pyrrolidone 
carboxyUc acid fijnnation. Side-chain protected amino acids are Boc-Arg(p-tolufinesulfonyl)-OH, 
Boc-Asn(xanthyl)-OH, Boc-Asp(0-cyclohexyl)-OH, Boc-Cys(4-methylbenzyl)-OH, Boc-Glu(0- 
30 cyclohexyl)-OH, Boc-His(dinitrophenylbenzyl)-OH, Boc-Lys(2-Cl-Z)-OH, Boc-Ser(benzyl)-OH, 
Boc-Thr(benzyl)-OH, Boc-Trp(cyclohexylcarbonyl)-OH and Boc-Tyr(2-Br-Z)-OH (Orpagen 
Phanna, Heidelberg, Germany). Other amino acids are used without side chain protection. C- 
tenninal Fragment 1 is synthesized on Boc-Leu-0-CH2-Pam resm (0.71mmol/g of loaded resin), 
while for Fragments 2 through 5 machine-assisted synthesis is started on the Boc-Xaa-S-CHa-CO- 
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Leu-Pam resin. This resin is obtained by the coupling of S-acetyMhioglycolic acid 
pentafluorophenylester to a Leu-PAM resin under standard conditions. The resulting resin is used as 
a starting resin for peptide chain elongation on a 0.2 mmol scale after removal of the acetyl 
protecting group witii a SOmin treatment with 10% mercaptoethanol, 10% piperidine in DMF. 

After chain assembly is completed, the peptide fragments axe dqjrotected and cleaved from 
the resin by treatment with anhydrous hydrogen fluoride for Ihr at C'C with 5% p-cresol as a 
scavenger. In all cases except Fragment 1, the imidazole side chain 2,4-dinitrophenyl (DNP) 
protecting groups remain on IBs residues because the DNP-removal procedure is incompatible with 
C-terminal thioester groups. However DNP is gradually removed by thiols during the ligation 
reaction, yielding unprotected His. After cleavage, peptide fragments are precipitated with ice-cold 
diethylether, dissolved in aqueous acetonitrile and lyophilized. The peptide fragments are purified by 
RP-HPLC with a C18 column from Waters by using linear gradients of buffer B (acetonitile/0. 1% 
trifluoroaoetic acid) in buffer A CH2O/0. 1% trifluoroacetic acid) and UV detection at 214nm. 
Samples are analyzed by electrospray mass spectrometry (ESMS) using an Esquire instrument 
(Brucker, Bremen , Germany),.or like instrument. 

Native chemical ligations 

As described more fiilly below, tiie ligation of unprotected fragmmts is perfonned as 
follows: the dry peptides are dissolved in equimolar amounts in 6M guanidine hydrochloride 
. (GuHCl), 0.2M phosphate, pH 7.5 in order to get a final peptide concentration of 1-8 mM at a pH 
around 7, and 1% benzyhnercaptan, 1% thiophenol is added. Usually, the reaction is carried out 
overnight and is monitored by HPLC and electrospray mass spectrometry. The Ugation product is 
subsequentiy treated to remove protecting groups still present. Opening of the N-terminal thiazolidine 
ring further required the addition of solid methoxamine to a 0.5M final concentration at pH3 .5 and a 
further incubation for 2h at 37°C. A 10-fold excess of Tris(2-carboxyetiiyl)phosphine is added 
before preparative HPLC purification. Fractions containing the polypeptide chain are identified by 
ESMS, pooled and lyophilized. 

The ligation of fragments 4 and 5 is performed at pH7.0 in 6 M GuHCl. The concentration 
of each reactant is 8mM, and 1% benzyhnercaptan and 1% thiophenol were added to create a 
reducing environment and to fecilitate the ligation reaction. An almost quantitative ligation reaction 
is observed after overnight stirring at 37X. At this point in the reaction, dHa-O-NHzHCl is added 
to the solution to get a 0 .5M final concentration, and the pH adjusted to 3 .5 in order to open the N- 
terminal thiazolidine ring. After 2h incubation at 37X, ESMS is used to confirm the conq)letion of 
the reaction. The reaction mixture is subsequently treated with a 10-fold excess of Tris(2- 
caiboxyethylphosphine) over the peptide fiagment and after ISmin, the ligation product is purified 

90 



wo 2004/061096 PCT/EP2004/000027 
using the preparative HPLC (e.g., C4, 20-60% CMaCN, 0.5% per min), lyophilized, and stored at - 

20*>C. 

The same procedure is repeated for the remaining ligations wiSi slight modifications. 



5 Polypeptide Folding 

The full length peptide is refolded by air oxidation by dissolving the reduced lyophilized protein 
(about 0. 1 mg/mL) in IM GuHCl, lOOmM Tris, lOmM methionine, pH 8.6, After gentle stirring 
overnight, the protein solution is purified by RP-HPLC as described above. 

. 10 Example 4: Preparation of CPP antibody compositions 

Substantially pure CPP or a portion thereof is obtained. The concentration of protein in the 
final preparation is adjusted, for example, by concentration on an Amicon filter device, to the level of 
a few micrograms per ml. Monoclonal or polyclonal antibodies to the protein are than prepared as 
described in the sections titied 'TMonoclonal antibodies" and *Tolyclonal antibodies." 

15 Bri^y, to produce an anti-CPP monoclonal antibody, a mouse is repetitively inoculated with a few 
micrograms of the CPP or a portion thereof over a period of a few weeks. The mouse is then 
sacrificed, and the antibody producing cells of the spleen isolated. The spleen cells are fiised by 
means of polyethylene glycol with mouse myeloma cells, and the excess unfiised cells destroyed by 
growth of the system on selective media conq>rising anoinopterin (HAT media). The successfiilly 

20 fiised- cells are diluted and aliquots of the dilution placed in wells of a microtiter plate where growth 
of the culture is continued. Antibody-producing clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, such as ELIS A, as originally described 
by Engvall, E., Meth. Enzymol. 70: 419 (1980), the disclosure of which is incorporated herein by 
reference in its entirety. Selected positive clones can be expanded and tiieir monoclonal antibody 

25 product harvested for use. Detailed procedures for monoclonal antibody production are described in 
Davis, L. et al. Basic Methods in Molecular Biology Elsevier, New York. Section 21-2, the 
disclosiure of which is incorporated herein by reference in its entirety. 

For polyclonal antibody production by immunization, polyclonal antiserum containing 
antibodies to heterogeneous epitopes in the CPP or a portion tiiereof are prepared by mtmvmimx^ a 

30 mouse with the CPP or a portion thereoi^ which can be unmodified or modified to enhance 

immunogenicity. Any suitable nonhuman animal, preferably a nQn-human mannnal m^y be selgctgd 
including rat, rabbit, goat, or horse. 

Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are usefiil in quantitative immunoassays which determine concentrations of CPP in 

35 biological samples; or they are also used semi-quantitatively or qualitatively to identify the presence 
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of antigen in a biological sample. The antibodies may also be used in therapeutic conq>ositions for 
killing qells e>q)ressing the protein or reducing the levels of the protein in the body. 
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